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^ (57) Abstract: The present invention is directed to lecithin: cholesterol acyltransferase-like polypeptides (LCAT) and acyl CoA: 
^ cholesterol acyltransferases-like polypeptides (ACAT). The invention provides polynucleotides encoding such cholesterol: acyl- 

transferases-like polypepddes, polypeptides, encoded by such polynucleotides, and the use of such polynucleotides to alter sterol 
^ composition and oil production in plants and host cells. Also provided are oils produced by the plants and host cells containing the 

polynucleotides and food products, nuUitional supplements, and pharmaceutical composition containing plants or oils of the present 
^ invention. The polynucleotides of the present invention include those derived from plant sources. 
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PLANT STEROL ACYLTRANSFERASES 

CROSS-REFERENCE TO RELATED APPLICATIONS 

5 This application claims priority to U.S. provisional application Serial No. 

60/1 52,493, filed August 30, 1999 and herein incorporated by reference in its entirety for 
all purposes. 

BACKGROUND 

10 Technical Field ' ^ . 

The present invention is directed to plant acyltransferase-like nucleic acid and 
amino acid sequences and constructs, and methods related to their use in altering sterol 
composition and/or content, and oil composition and/or content in host cells and plants, 

15 Related Art 

Through the development of plant genetic engineering techniques, it is now possible 
to produce transgenic varieties of plant species to provide plants which have novel and 
desirable characteristics. For example, it is now possible to genetically engmeer plants for 
tolerance to environmental stresses^ such as resistance to pathogens and tolerance to 
2 0 herbicides. It is also possible to improve the nutritional characteristics of the plant, for 
example to provide improved fatty acid, carotenoid, sterol and tocopherol compositions. 
However, the number of useful nucleotide sequences for the engineering of such 
characteristics is thus far limited. 

There is a need for improved means to obtain or manipulate compositions of sterols 

2 5 from biosynthetic or natural plant sources. The ability to increase sterol production or alter 

the sterol compositions in plants may provide for novel sources of sterols for use in human 
and animal nutrition. 

Sterol biosynthesis branches from the famesyl diphosphate intermediate in the 
isoprenoid pathway. Sterol biosynthesis occurs via a mevalonate dependent pathway in 

3 0 mammals and higher plants (Goodwin,(1981) Biosynthesis of Isoprenoid Compounds, vol 

I (Porter, J.W. & Spurgeon, S.L., eds) pp.443-480, John Wiley and Sons, New Yoric), 
while in green algae sterol biosynthesis is thought to occur via a mevalonate independent 
pathway (Schwender, et al (1 997) Physiology, Biochemistry, and Molecular Biology of 
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Plant Lipids, (Williams, J.P., Khan, M.U., and Lem, N,W., eds) pp. 180-182, Kluwer 
Academic Publishers, Norwell, MA). 

The solubility characteristics of sterol esters suggests that this is the storage form of 
sterols (Chang, etaL, (1997) Amu, Rev. Biochem., 66:613-638). Sterol 0-acyltransferase 
5 enzymes such as acyl CoArcholesterol acyltransferase (ACAT) and lecithin:cholesterol 
acyltransferase (LCAT) catalyze the formation of cholesterol esters, and thus are key to 
controlling the intracellular cholesterol storage. In yeast, it has been reported that 
overexpression of LROl, a homolog of human LCAT, and phospholipid:diacylglycerol 
acyltransferase increased lipid synthesis (Oelkers et al, (2000) 1 Biol Chem., 26:15609- 

10 15612; Dahlqvist et aL, (2000) Proc, Natl Acad. Set USA, 97:6487-6492). 

The characterization of various acyltransferase proteins is useful for the further 
study of plant sterol synthesis systems and for the development of novel and/or alternative 
sterol sources. Studies of plant mechanisms may provide means to further enhance, 
control, modify, or otherwise alter the sterol composition of plant cells. Furthennore, such 

15 alterations in sterol content and/or composition may provide a means for obtaining 

tolerance to stress and insect damage. Of particular interest are the nucleic acid sequences 
of genes encoding proteins which may be useful for applications in genetic engineering. 

SUMMARY OF THE INVENTION 

2 0 The present invention is directed to lecithiri:cholesterol acyltransferase-like 

polypeptides (also referred to herein as LCAT) and acyl CoAxholesterol acyltransferase- 
like polypeptides (also referred to herein as ACAT), In particular the invention is related 
to polynucleotides encoding such sterol:acyltransferases, polypeptides encoded by such 
polynucleotides, and the use of such polynucleotides to alter sterol composition and oil 

25 production. The polynucleotides of the present invention include those derived from plant 
sources. 

One aspect of the invention, therefore^ is an isolated nucleic acid sequence 
encoding a plant lecithinxholesterol acyltransferase-like polypeptide, a fragment of a plant, 
lecithinxholesterol acyltransferase-like polypeptide, a plant acyl CoAxholesterol 
30 acyltransferase^ike polypeptide or a fragment of a plant acyl CoAxholesterol 
acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of 
SEQ ID NO: 2, 4, 6, 8, 10-29, 43-51, 73 or 75. Also provided is an isolated nucleic acid 
sequence consisting of SEQ ID NO: 2, 4, 6, 8, 10-29, 43-51, 73 or 75 . 
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Still another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO: 3 or SEQ ID NO: 3 with at least one conservative amino acid 
substitution; SEQ ID NO: 2; an isolated polynucleotide that has at least 70%, 80%, 90%, 
5 or 95% sequence identity with SEQ ID NO: 2; an isolated polynucleotide of at least 10 
amino acids that hybridizes under stringent conditions to SEQ ID NO: 2; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 2 and encodes a plant 
lecithin:chole$terol acyltransferase-like polypeptide, 

10 Still another aspect provides an isolated nucleic acid sequence consisting 

essentially of a polynucleotide of the formula 5* X-(R,V(R2)n-(R3)n-Y 3* where X is a 
hydrogen, Y is a hydrogen or a metal, R, and are any nucleic acid, n is an integer 
between 0-3000, and Rj is selected from the group consisting of an isolated polynucleotide 
encoding a polypeptide of SEQ ID NO: 3 or SEQ ID NO: 3 with at least one conservative 

1 5 amino acid substitution; SEQ ID NO: 2; an isolated polynucleotide that has at least 70%, 
80%, 90%, or 95% sequence identity with SEQ ID NO: 2; an isolated polynucleotide of at 
least 10 amino acids that hybridizes under stringent conditions to SEQ ID NO: 2; an 
isolated polynucleotide complementary to any of the foregoing; and an isolated 
polynucleotide that hybridizes under stringent conditions to SEQ ID NO: 2 and encodes a 

2 0 plant lecithin:cholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID N0:5 or SEQ ID NO: 5 with at least one conservative amino acid 
substitution; SEQ ID NO: 4; an isolated polynucleotide that has at least 70%, 80%, 90%, 
25 or 95% sequence identity with SEQ ID NO: 4; an isolated polynucleotide of at least 10 
amino acids that hybridizes under stringent conditions to SEQ ID NO: 4; an isolated 
polynucleotide complementaiy to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 4 and encodes a plant 
lecithin:cholesterol acyltransferase-like polypeptide. 

3 a Another aspect provides an isolated nucleic acid sequence consisting essentially of 

a polynucleotide of the formula 5' X"(R,)„-(R2)„-(R3)^-Y 3* where X is a hydrogen, Y is a 
hydrogen or a metal, R, and Rj are any nucleic acid, n is an integer between 0-3000, and 
R2 is selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO: 5 or SEQ ID NO: 5 with at least one conservative amino acid 
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substitution; SEQ ID NO: 4; an isolated polynucleotide that has at least 70%, 80%. 90%, 
or 95% sequence identity with SEQ ID NO: 4; an isolated polynucleotide of at least 10 
amino acids that hybridizes under stringent conditions to SEQ ID NO: 4; an isolated 
polynucleotide complementary to any of the foregomg; and an isolated polynucleotide that 
5 hybridizes under stringent conditions to SEQ K) NO: 4 and encodes a plant 
lecithin:cholesteroI acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ED N0:7 or SEQ ID NO: 7 with at least one conservative amino acid 

1 0 substitution; SEQ ID NO: 6; an isolated polynucleotide that has at least 70%, 80%, 90%, 
or 95% sequence identity with SEQ ID NO: 6; an isolated polynucleotide of at least 10 
amino acids that hybridizes under stringent conditions to SEQ ID NO: 6; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 6 and encodes a plant 

15 lecithin:cholesterol acyltransferase-like polypeptide, 

Another aspect provides an isolated nucleic acid sequence consisting essentially of 
a polynucleotide of tihe fomiula 5' X-(R,)„-(R2)„-(R3VY 3' where X is a hydrogen, Y is a 
hydrogen or a metal, R, and Rj are any nucleic acid, n is an integer between 0^3000, and 
R2 is selected Scorn the group consisting of an isolated polynucleotide encoding a 

2 0 polypeptide of SEQ ID NO: 7 or SEQ ID NO: 7 with at least one conservative ammo acid 
substitution; SEQ ID NO: 6; an isolated polynucleotide that has at least 70%, 80%, 90%, 
or 95% sequence identity with SEQ ED NO: 6; an isolated polynucleotide of at least 10 
amino acids that hybridizes under stringent conditions to SEQ ID NO: 6; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 

2 5 hybridizes under stringent conditions to SEQ ID NO: 6 and encodes ia plant 

lecithin:cholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence comprising a . 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID N0:9 or SEQ ID NO: 9 with at least one conservative amino acid 

3 0 substitution; SEQ ID NO: 8; an isolated polynucleotide that has at least 70%, 80%, 90%, 

or 95% sequence identity with SEQ ID NO: 8; an isolated polynucleotide of at least 10 
amino acids that hybridizes imder stringent conditions to SEQ ID NO: 8; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
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hybridizes under stringent conditions to SEQ ID NO: 8 and encodes a plant 
lecithinxholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of 
a polynucleotide of the formula 5' X-(R,)n-(R2)„-(R3)n-y 3* where X is a hydrogen, Y is a 
5 hydrogen or a metal, R, and Rj are any nucleic acid, n is an integer between 0-3000, and 
Rj is selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO: 9 or SEQ ID NO: 9 with at least one conservative amino acid 
substitution; SEQ ID NO: 8; an isolated polynucleotide that has at least 70%, 80%, 90%, 
or 95% sequence identity with SEQ ID NO: 8; an isolated polynucleotide of at least 10 

10 amino acids that hybridizes under stringent conditions to SEQ ID NO: 8; an isolated 

polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 8 and encodes a plant 
lecithinrcholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence comprising a 

1 5 polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO:74 or SEQ ID NO: 74 with at least one conservative amino 
acid substitution; SEQ ID NO: 73; an isolated polynucleotide that has at least 70%, 80%, 
90%, or 95% sequence identity with SEQ ID NO: 73; an isolated polynucleotide of at least 
10 amino acids that hybridizes under stringent conditions to SEQ ID NO: 73; an isolated 

2 b polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 73 and encodes a plant 
lecithin:cholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of 
a polynucleotide of the formula 5* X-<Rj V(R2)n-(R3)n'Y 3* where X is a hydrogen, Y is a 

2 5 hydrogen or a metal, Rj and Rj are any niicleic acid, n is an integer between 0-3000, and 
Rj is selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO: 74 or SEQ ID NO: 74 with at least one conservative amino 
acid substitution; SEQ ID NO: 73; an isolated polynucleotide that has at least 70%, 80%, 
90%, or 95% sequence identity with SEQ ID NO: 73; an isolated polynucleotide of at least 

30 10 amino acids that hybridizes under stringent conditions to SEQ ID NO: 73 ; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 73 and encodes a plant 
. lecithin:cholesterol acyltransferase-like polypeptide. 
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Another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO:76 or SEQ ED NO: 76 with at least one conservative amino 
acid substitution; SEQ ID NO: 75; an isolated polynucleotide that h2s at least 70%, 80%, 
5 90%, or 95% sequence identity with SEQ ID NO: 75; an isolated polynucleotide of at least 
10 amino acids that hybridizes under stringent conditions to SEQ ID NO: 75; an isolated 
polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 75 and encodes a plant 
lecithin:cholesterol acyltransferase-like polypeptide/ 

10 Another aspect provides an isolated nucleic acid sequence consisting essentially of 

a polynucleotide of the foimula 5' X-(R,)„-(R2)n"'(R3VY 3* where X is a hydrogen, Y is a 
hydrogen or a metal, R, and R;^ are any nucleic acid, n is an integer between 0-3000, and 
R2 is selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO: 76 or SEQ ID NO: 76 with at least one conservative amino 

15 acid substitution; SEQ ID NO: 75; an isolated polynucleotide that has at least 70%, 80%, 
90%, or 95% sequence identity with SEQ ID NO: 75; an isolated polynucleotide of at least 
1 0 amino acids that hybridizes under stringent conditions to SEQ ID NO: 75; an isolated 
polynucleotide corrq)lementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 75 and encodes a plant 

2 0 lecithin:cholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of SEQ ID NO: 42 or a degenerate 
variant thereof; an isolated polynucleotide tiiat has at least 70%, 80%, 90%, or 95% 
sequence identity with SEQ ID NO: 42; an isolated polynucleotide of at least 10 amino 

2 5 acids that hybridizes under stringent conditions to SEQ ID NO: 42; an isolated 

polynucleotide complementary to any of the foregoing; and an isolated polynucleotide that 
hybridizes under stringent conditions to SEQ ID NO: 42 and encodes an acyl 
CoA:cholesterol acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of 

30 a polynucleotide of the fomiula 5' X-(R,)„-(R^„-(Rj)„-Y 3* where X is a hydrogen, Y is a 
hydrogen or a metal, R^ and R^ are any nucleic acid, n is an integer between 0-3000, and 
R^ is selected from the group consisting of SEQ ID NO: 42 or a degenerate variant thereof; 
an isolated polynucleotide that has at least 70%, 80%, 90%, or 95% sequence identity with 
SEQ ID NO: 42; an isolated polynucleotide of at least 10 ainino acids that hybridizes 
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under stringent conditions to SEQ ID NO: 42; an isolated polynucleotide complementary 
to any of the foregoing; and an isolated polynucleotide that hybridizes under stringent 
conditions to SEQ ID NO: 42 and encodes a acyl CoA:cholesterol acyltransferase-like . 
polypeptide. 

5 Also provided is a recombinant nucleic acid construct comprising a regulatory 

sequence operabiy linked to a polynucleotide encoding a lecithin:cholesterol 
acyltransferase-like polypeptide and/or an acyl CoA:cholesterol acyltransferase-like 
polypeptide. In one embodiment, the sterol acyl transferases are plant sterol acyl 
transferases. In another embodiment, the recombmant nucleic acid constructs further 

10 comprises a termination sequence. The regulatory sequence can be a constitutive 

promoter, an inducible promoter, a developmentally regulated promoter, a tissue specific 
promoter, an organelle specific promoter, a seed specific promoter or a combination of any 
of the foregoing. Also provided is a plant containing this recombinant nucleic acid 
construct and the seed and progeny from such a plant. This recombinant nucleic acid 

15 construct can also be introduced into a suitable host cell to provide yet another aspect of 
the invention. If the host cell is a plant host cell, the cell can be used to generate a plant to 
provide another aspect of the invention. Further aspects include seed and progeny from 
such a plant. 

Yet another aspect is a purified polypeptide comprising, SEQ ID NO: 3, SEQ ID 
20 NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 74, SEQ ID NO: 76, or any of the 
preceding sequences with at least one conservative amino acid substitution. 

Still another aspect provides a purified immunogenic polypeptide comprising at 
least 10 consecutive amino acids from an amino acid sequence selected from the group 
consisting of SEQ ID NO: 3, 5, 7, 9, 74, 76 and any of the preceding sequences containing 
25 at least one conservative amino acid substitution. Also provided are antibodies, either 

polyclonal or monoclonal, that specifically bind the preceding immunogenic polypeptides. 

One aspect provides a method for producing a lecithinrcholesterol acyltransferase- 
like polypeptide or an acyl CoA:cholesterol acyltransferase-like polypeptide comprising 
culturing a host cell containing any recombinant nucleic acid construct of the present 
3 0 invention under condition permitting expression of said lecithin:cholesterol 

acyltransferase-like polypeptide or acyl CoA:cholesterol acyltransferase-like polypeptide. 

Another aspect provides a method for modifying the sterol content of a host cell, 
comprising transforming a host cell with a recombinant construct containing a regulatory 
sequence operabiy linked to a polynucleotide encoding a lecithin:cholesterol 
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acyltransferase-like polypeptide and culturing said host cell xmder conditions wherein said 
host ceil expresses a lecithinicholesterol acyltransferase-like polypeptide such that said 
host cell has a modified sterol composition as compared'to host cells without the 
recombinant construct. 

5 An additional aspect is a method for modifying the sterol content of a host cell 

comprising transforming a host cell with a recombinant construct containing a regulatory 
sequence operabiy linked to a polynucleotide encoding an acyl CoAxholesterol 
acyltransferase-like polypeptide and culturing said host cell under conditions wherein said 
. . host cell expresses an acyl CoAxholesterol acyltransferase-like polypeptide such that said 
10 host cell has a modified sterol composition as compared to host cells without the 
recombinant construct, 

A fiirtber aspect is a plant comprising a recombinant construct containing a 
regulatory sequence operabiy linked to a polynucleotide encoding a lecithin:cholesterol 
acyltransferase-like polypeptide wherein expression of said recombinant construct results 
15 in modified sterol composition of said plant as compared to the same plaiit without said 
recombinant construct. 

Another aspect provides a plant comprising a recombinant construct containing a 
regulatory sequence operabiy linked to a polynucleotide encoding an acyl CoAxholesterol 
acyltransferase-like polypeptide wherein expression of said recombinant construct results 
20 in modified sterol composition of said plant as compared to the same plant without said 
recombinant construct* 

In a further aspect is provided an oil obtained firom any of the plants or host cells of 
the present invention. 

In still another aspect is provided a method for producing an oil with a modified 

2 5 sterol composition comprising providing any of the plants or host cells of the present 

invention and extracting oil fi-om the plant by any known method. Also provided is an oil 
produced by the preceding method. 

Still another aspect provides a method for altering oil production by a host cell 
comprising, transforming a host cell with a recombinant construct containing a regulatory 

3 0 sequence operabiy linked to a polynucleotide encoding a lecitfainxholesterol 

acyltransferase-like polypeptide and culturing the host cell under conditions wherein the 
host cell expresses a ledthinrcholesterol acyltransferase-like polypeptide such that the host 
cell has an altered oil production as compared to host cells without the recombinant . 
construct. 



wo 01/16308 PCT/tJSOO/23863 

9 

Another aspect provides a method for altering oil production by a host cell 
comprising, transforming a host cell with a recombinant construct containing a regulatory 
sequence operably linked to a polynucleotide encoding an acyl CoA:cholesterol 
acyltransferase*like polypeptide and culturing the host cell under conditions wherein the 
5 host cell expresses an acyl CoA:cholesterol acyltransferase-like polypeptide such that the 
host cell has an altered oi! production as compared to host cells without the recombinant 
construct. 

Also provided is a plant comprising a recombinant construct containing a 
regulatory sequence operably linked to a polynucleotide encoding a lecithin: cholesterol 
1 0 acyltransferase-like polypeptide wherein expression of said recombinant construct results 
in an altered production of oil by said plant as compared to the same plant without said 
recombinant construct. 

In a further aspect is provided a plant comprising a recombinant construct 
containing a regulatory sequence operably linked to a polynucleotide encoding an acyl 
1 5 CoAxholesterol acy Itransf^ase-like polypeptide wherein expression of said recombinant 
construct results in an altered production of oil by said plant as compared to the same plant 
without saidrecombinant construct 

Additional aspects provide a food, food ingredient or food product comprising any 
oil, plant or host cell of the present invention; a nutritional or dietary supplement 
2 0 comprising any oil, plant or host cell of the present invention; and a pharmaceutical 
composition comprising any oil, plant or host cell of the present invention along with a 
suitable diluent, carrier or excipient. 

Additional aspects will be apparent from the descriptions and examples that follow. 

2 5 BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features, aspects, and advantages of the present invention will 
become better understood with regard to the following description, appended claims and 
accompanying figures where: 

Figure 1 shows an alignment of yeast, human and rat lecithinxholesterol 

3 0 acyltransferase protein sequences witii Arabidopsis LCATl , LCAT2, LCAT3, and LCAT4 

deduced amino acid sequences* 

Figure 2 shows the results of NMR sterol ester analysis on T2 seed from plant 
expressing LCAT4 under the control of the napin promoter (pCGN9998), 
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Figure 3 shows the results of HPLC/MS sterol analysis on oil extracted from T2 
seed from control lines (pCGN8640) and lines expressing LCAT3 (pCGN9968) under the 
control of the napin promoter. 

Figure 4 shows the results of HPLC/MS sterol analysis on oil extracted from T2 
5 seed from control lines (pCGN8640), and plant line expressing LCATl (pCGN9962), 
LCAT2 (pCGN9983), LCAT3 (pCGN9968), and LCAT4 (pCGN9998) under the control 
of the napin promoter Additionally, data from 3 lines expressing LCAT4 under the 
control of the 35S promoter (pCGN9996) are shown. 

Figure 5 shows the results of Nir analysis of the oil content of T2 seed from control 
1 0 lines (pCGN8640), and plant lines expressing LCATl (pCGN9962)^ LCAT2 

(pCGN9983), and LCAT3 (pCGN9968) under the control of the napin promoter. 
Additionally, data from 16 lines expressing LCAT2 xmder the control of the 35S promoter 
(pCGN9981) are shown. 

15 DETAILED DESCRIPTION ^ 

The following detailed description is provided to aid those skilled in the art in 
practicing the present invention. Even so» this detailed description should not be constmed 
to unduly limit the present invention as modifications and variations in the embodiments 
discussed herein can be made by those of ordinary skill in the art without departing from 

2 0 the spfrit or scope of the present inventive discovery. 

All publications, patents, patent applications and other references cited in this 
application are herein incorporated by reference in their entirety as if each individual 
publication, patent^ patent application or other reference were specifically and individually 
indicated to be incorporated by reference. 

25 The present invention relates to lecithinxholesterol acyltransferase, particularly the 

isolated nucleic acid sequences encoding lecithin:cholesterol-like polypeptides (LCAT) 
from plant sources and acyl CoA:cholesterol:acyltransferase, particularly the isolated 
nucleic acid sequences encoding acyl CoAxholesterol acyltransferase-like polypeptides 
(ACAT) from plant sources. Lecithinxholesterol acyltransferase-like as used herein 

30 includes any nucleic acid sequence encoding an amino acid sequence from a plant source, 
, such as a protein, polypeptide or peptide, obtainable from a cell source, which 
demonstrates the ability to utilize lecithin (phosphatidyl choline) as an acyl donor for 
acylation of sterols or glycerides to form esters under enzyme reactive conditions along 
with such proteins polypeptides and peptides. Acyl CoAxholesterol acyltransferase-like 
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as used herein includes any nucleic acid sequence encoding an amino acid sequence from a 
plant source, such as a protein, polypeptide or peptide, obtainable from a cell source, 
which demonstrates the ability to utilize acyl CoA as an acyl donor for acylation of sterols 
or glycerides to form esters under enzyme reactive conditions along with such proteins 
5 polypeptides and peptides. By "enzyme reactive conditions" is meant that any necessary 
conditions are available in an environment (i.e., such factors as temperature, pH, lack of 
inhibiting substances) which will permit the enzyme to function. 

The term "sterol" as apphed to plants refers to any chiral tetracyclic isopentenoid 
which may be formed by cyclization of squalene oxide through the transition state 

10 possessing stereochemistry similar to the trans-syn-trans-anti-trans-anti configuration, for 
example, protosteroid cation, and which retains a polar group at C-3 (hydroxyl or keto), an 
alUtrans-anti stereochemistry in the ring system, and a side-chain 20R-configuration 
(Parker, et al (1992) In Nes, et al, Eds,, Regulation of Isopentenoid Metabolism, ACS 
Symposium Series No, 497, p. 1 10; American Chemical Society, Washington, D.C.)- 

1 5 Sterols may or may not contain a C-5-C-6 double bond, as this is a feature 

introduced late in the bidsynthetic pathway. Sterols contain a Q-C^o side chain at the C-17 
position. 

The term **phytosterol," which applies to sterols found uniquely in plants, refers to 
a sterol containing a C-5, and in some cases a C-22, double bond. Phytosterols are fiirther 
2 0 characterized by alkylation of the C-l? side-chain with a methyl or ethyl substituent at the 
C-24 position. Major phjrtosterols include, but are not limited to, sitosterol, stigmasterol, 
campesterol, brassicasterol, etc. Cholesterol, which lacks a C-24 methyl or ethyl side- 
chain, is found in plants, but is not unique thereto, and is not a "phytosteroL" 

"Phytostanols** are saturated forms of phytosterols wherein the C-5 and, when 
25 present, C-22 double bond(s) is (are) reduced, and include, but are not limited to, 
sitostanol, campestanol, and 22-dihydrobrassicastanol. 

"Sterol esters" are fijrther characterized by the presence of a fatty acid or phenolic 
acid moiety rather than a hydroxyl group at the C-3 position. 

The term "sterol" includes sterols, phytosterols,. phytosterol esters, phytostanols, 
30 and phytostanol esters. 

The term "sterol compounds" includes sterols, phyotsterols, phytosterol esters, 
phytostanols, and phytostanol esters. 

The term "phytosterol compound" refers to at least one phytosterol, at least one 
phytosterol ester, or a mixture thereof 
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The term "phytostanol compound*' refers to at least one phytostanol, at least one 
phytostanol ester, or a mixture thereof. 

The term "glyceride** refers to a fatty acid ester of glycerol and includes mono-, di-, 
and tri- acylglycerols. 

5 As used herein, "recombinant construct" is defined either by its method of 

production or its structure. In reference to its method of production, e.g., a product made 
by a process, the process is use of recombinant nucleic acid techniques, e,g„ involving 
human intervention in the nucleotide sequence, typically selection or production. 
Alternatively, in terms of structure, it can be a sequence comprising fusion of two or more 
1 0 nucleic acid sequences which are not naturally contiguous or operatively linked to each 
other 

As used herein, regulatory sequence" means a sequence of DNA concerned with 
controlling expression of a gene; e.g. promoters, operators and attenuators. A 
"heterologous regulatory sequence" is one which differs fi-om the regulatory sequence 
1 5 naturally associated with a gene. ^ 

As used herein, "polynucleotide" and "oligonucleotide" are used interchangeably 
and mean a polymer of at least two nucleotides joined together by a phosphodiester bond 
and may consist of either ribonucleotides or deoxynucleotides- 

As used herein, " sequence" means the linear order in which monomers appear in a 
20 polymer, for example, the order of amino acids in a polypeptide or the order of nucleotides 
in a polynucleotide. 

As used herein, "polypeptide" , "peptide", and "protein" are used interchangeably 
and mean a compound that consist of two or more amino acids that are linked by means of 
peptide bonds. 

25 As used herein, the terms "complementary" or "complementarity" refer to the 

pairing of bases, purines and pyrimidines, that associate through hydrogen bonding in 
double stranded nucleic acids. For example, the following base pairs are complementary: 
guanine and cytosine; adenine and thymine; and adenine and uracil. The terms, as used 
herein, include complete and partial complementarity. 
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Isolated proteins. Polypeptides and Polynucleotides 

A first aspect of the present invention relates to isolated LCAT polynucleotides. 
The polynucleotide sequences of the present invention include isolated polynucleotides 
that encode the polypeptides of the invention having a deduced amino acid sequence . 
5 selected from the group of sequences set forth in the Sequence Listing and to other 
polynucleotide sequences closely related to such sequences and variants thereof. 

The invention provides a polynucleotide sequence identical over its entire length to 
each coding sequence as set forth in the Sequence Listing. The invention also provides the 
coding sequence for the mature polypeptide or a fragment thereof, as well as the coding 
10 sequence for the mature polypeptide or a fragment thereof in a reading frame with other 
coding sequences, such as those encoding a leader or secretory sequence, a pre-, pro-, or 
prepro- protein sequence. The polynucleotide can also include non-coding sequences, 
. including for example, but not limited to, non-coding 5' and 3* sequences, such as the 
transcribed, untranslated sequences, termination signals, ribosome binding sites, sequences 
15 that stabilize mRNA, introns, polyadenylation signals, and additional coding sequence that 
encodes additional amino acids. For example, a marker sequence can be included to 
facilitate the purification of the fiised polypeptide. Polynucleotides of the present 
invention also include polynucleotides comprising a structural gene and the naturally 
associated sequences that control gene expression. 

2 0 The invention also includes polynucleotides of the formula: 

X.(R0„-(R.>(R3)„-Y 

wherein, at the 5' end, X is hydrogen, and at the 3' end, Y is hydrogen or a metal, R, and 
R3 are any nucleic acid residue, n is an integer between 0 and 3000, preferably between 1 
and 1000 and is a nucleic acid sequence of the invention, particularly a nucleic acid 
25 sequence selected from the group set forth in the Sequence Listing and preferably SEQ ID 
NOs: 2, 4, 6, 8, 10-29, 33, 42-51, 73 and 75. In the foraiula, Rj is oriented so that its 5' 
end residue is at the left, bound to R„ and its 3' end residue is at the right, bound to Ry 
Any stretch of nucleic acid residues denoted by either R group, where R is greater than 1, 
may be either a heteropolymer or a homopolymer, preferably a heteropolymer. 

3 0 The invention also relates to variants of the polynucleotides described herein that 

encode for variants of the polypeptides of the.invention. Variants that are fragments of the 
polynucleotides of the invention can be used to synthesize full-length polynucleotides of 
the invention: Preferred embodiments are polynucleotides encoding polypeptide variants 
wherein 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues of a polypeptide sequence of 
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the invention are substituted, added or deleted, in any combination. Particularly preferred 
are substitutions, additions, and deletions that are silent such that they do not alter the 
properties or activities of the polynucleotide or polypeptide. 

Further preferred embodiments of the invention that are at least 50%, 60%, or 70% 
5 identical over their entire length to a polynucleotide encoding a polypeptide of the 
invention, and polynucleotides that are complementary to such polynucleotides. More 
preferable are polynucleotides that comprise a region that is at least 80% identical over its 
entire length to a polynucleotide encoding a polypeptide of the invention and 
polynucleotides that are complementary thereto. In this regard, polynucleotides at least 

1 0 90% identical over their entire length are particularly preferred, those at least 95% 
identical are especially preferred. Further, those with at least 97% identity are highly 
preferred and those with at least 98% and 99% identity are particularly highly preferred, 
with those at least 99% being the most highly preferred. 

Preferred embodiments are polynucleotides^ that encode polypeptides that retain 

1 5 substantially the same biological function or activity as determined by the methods 

described h^ein as the mature polypeptides encoded by the polynucleotides setfortb in the 
Sequence Listing. 

The invention further relates to polynucleotides that hybridize to the above- 
described sequences. In particular, the invention relates to polynucleotides that hybridize 

2 0 under stringent conditions to the above-described polynucleotides. An example of 

stringent hybridization conditions is overnight incubation at 42^C in a solution comprising 
50% formamide, 5x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7,6), 5x Denhardt's solution, 10% dextran sulfate, and 20 
micrograms/milliliter denatured, sheared sahnon sperm DNA, followed by washing the 
25 hybridization support in O.lx SSC at approximately 65^C. Also included are 

polynucleotides that hybridize under a wash stringency of O.IX SSC or 0,1X SSPE (at 
50^C. Other hybridization and wash conditions are well known and are exemplified in 
Sambrook, et al. Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring 
Harbor, NY (1989)^ particularly Chapter 1 L 

3 0 The invention also provides a polynucleotide consisting essentially of a 

polynucleotide sequence obtainable by screening an appropriate library containing the 
complete gene for a polynucleotide sequence set for in the Sequence Listing under 
stringent hybridization conditions with a probe having the sequence of said polynucleotide 
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sequence or a fragment thereof; and isolating said polynucleotide sequence. Methods for 
screening libraries are well known in the art and can be found for example in Sambrook, et 
al. Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, NY 
(1989), particularly Chapter 8 and Ausubel et al,, Short Protocols in Molecular Biology, 
5 3"* ed, Wiley and Sons, 1995, chapter 6, Nucleic acid sequences useful for obtaining such 
a polynucleotide include, for example, probes and primers as described herein and in 
particular SEQ ID NO: 2, 4, 6, 8, 10-29, 33, 42-51, 73 and 75. These sequences are 
particularly useful in screening libraries obtained from Arabidopsis, soybean and com for 
sequences encoding lecithinxholesterol acyltransferase and lecithinxholesterol 
10 acyitransferase-like polypeptides and for screening libraries for sequences encoding acyl 
CoAxholesterol acyl transferase and acyl CoAxholesterol acyl transferase-like 
polypeptides. 

As discussed herein regarding polynucleotide assays of the invention, for example, 
polynucleotides of the invention can be used as a hybridization probe for RNA, cDNA, or 

1 5 genomic DNA to isolate fixll length cDNAs or genomic clones encoding a polypeptide and 
to isolate cDNA or genomic clones of other genes that have a high sequence similarity to a 
polynucleotide set forth in the Sequence Listing and in particular SEQ ID NO; 2, 4, 6, 8, 
10-29, 33, 42-51, 73 and 75. Such probes will generally comprise at least 15 bases. 
Preferably such probes will have at least 30 bases and can have at least 50 baj^es. 

2 0 Particularly preferred probes will have between 30 bases and 50 bases, inclusive. 

The coding region of each gene that comprises or is comprised by a polynucleotide 
sequence set forth in the Sequence Listing may be isolated by screening using a DNA 
sequence provided in the Sequence Listing to synthesize an oligonucleotide probe. A 
labeled oligonucleotide having a sequence complementary to that of a gene of the 

2 5 invention is then used to screen a library of cDNA, genomic DNA or mRNA to identify 
members ofthe library which hybridize to the probe. For example, synthetic 
oUgonucleotide3 are prepared which correspond to the LCAT EST sequences. The 
oUgonucleotides are used as primers in polymerase chain reaction (PGR) techniques to 
obtain 5* and 3' terminal sequence of LCAT genes. Alternatively, where oligonucleotides 

30 of low degeneracy can be prepared from particular LCAT peptides, such probes may be 
used directly to screen gene libraries for LCAT gene sequences. In particular, screening of 
cDNA libraries in phage vectors is usefiil in such methods due to lower levels of 
background hybridization. 
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Typically, a LCAT sequence obtainable &om the use of nucleic acid probes will 
show 60-70% sequence identity between the target LCAT sequence and the encoding 
sequence used as a probe. However, lengthy sequences with as little as 50-60% sequence 
identity may also be obtained. The nucleic acid probes may be a lengthy fragment of the 
5 nucleic acid sequence, or may also be a shorter, oligonucleotide probe. When longer 
nucleic acid fragments are employed as probes (greater than about 100 bp), one may 
screen at lower stringencies in order to obtain sequences from the target sample which 
have 20-50% deviation (i.e., 50-80% sequence homology) from the sequences used as 
probe. Oligonucleotide probes can be considerably shorter than the entire nucleic acid 

10 sequence encoding an LCAT enzyme, but should be at least about 10, preferably at least 
about 15, and more preferably at least about 20 nucleotides. A higher degree of sequence 
identity is desired when shorter regions are used as opposed to longer regions. It may thus 
be desirable to identify regions of highly conserved amino acid sequence to design 
oligonucleotide probes for detecting and recovering other related LCAT genes. Shorter 

1 5 probes are often particularly useful for polymerase chain reactions (PGR), especially when 
highly conserved sequences can be identified. (Sfee, Gould, et aLy PNAS USA (1989) 
55:1934-1938.). 

Another aspect ofthe present invention relates to LCAT polypeptides. Such 
polypeptides include isolated polypeptides set forth in the Sequence Listing, as well as 
2 0 polypeptides and fragments thereof, particularly those polypeptides which exhibit LCAT 
activity and also those polypeptides which have at least 50%, 60% or 70% identity, • 
preferably at least 80% identity, more preferably at least 90% identity, and most preferably 
at least 95% identity to a polypeptide sequence selected from the group of sequences set 
forth in the Sequence Listing, and also include portions of such polypeptides, wherein 

2 5 such portion ofthe polypeptide preferably includes at least 30 amino acids and more 

preferably includes at least 50 amino acids. 

"Identity", as is well understood in the art, is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as detemiined by 
comparing the sequences. In the art, "identity"' also means the degree of sequence 

3 0 relatedness between polypeptide or polynucleotide sequences, as determined by the match 

between strings of such sequences. "Identity*' can be readily calculated by known methods 
including, but not limited to, those described in Computational Molecular Biology^ Lesk, 
A.M,, ed., Oxford University Press, New York (1988); Biocomputing: Informatics and 
Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; Computer Analysis 
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of Sequence Data, Part /, Griffin, A.M. and Griffin, H.G., eds., Humana Press, New Jersey 
(1994); Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press (1987); 
Sequence Analysis Primery Gribskov, M, and Devereux, J,, eds*, Stockton Press, New 
York (1991); and Carillo, K, and Lipman, D., SIAM J Applied Math, 48:1073 (1988). 
5 Methods to determine identity are designed to give the largest match between the 
sequences tested. Moreover, methods to determine identity are codified in publicly 
available programs. Computer programs which can be used to determine identity between 
two sequences include, but are not limited to, GCG (Devereux, J., et al.. Nucleic Acids 
Research 12(1):387 (1984); suite of five BLAST programs, three designed for nucleotide 

1 0 sequences queries (BLASTN, BLASTX, and TBLASTX) and two designed for protein 
sequence queries (BLASTP and TBLASTN) (Coulson, Trends in Biotechnology. 12: 76- 
80 (1994); Birren, et aL Genome Analysis, 1: 543-559 (1997)). The BLAST X program is 
publicly available from NCBI and other sources {BMST ManmK MischvA, S., et al, 
NCBI NLM NIH, Bethesda, MD 20894; Altschul, S,, et aL, 1 Mol BioL, 215:403-410 

1 5 (1990)). The well known Smith Waterman algorithm can also be used to detemune 
identity. 

Parameters for polypeptide sequence comparison typically include the foUowinjg: 
Algorithm: Needleman and Wunsch» J. Mol. Biol 48:443-453 (1970) 
Comparison matrix: BLOSSUM62 from Hentikoff and Hentikoflf» Proc. Natl 
2 0 Acad. Sci USA 89:10915-10919 (1992) 
Gap Penalty: 12 
Gap Length Penalty: 4 

A program which can be used with these parameters is publicly available as the 
"gap" program from Genetics Computer Group, Madison Wisconsin. The above 
2 5 parameters along with no penalty for end gap are the defaxilt parameters for peptide 
comparisons. 

Paramet^ for polynucleotide sequence comparison include the following: 
Algorithm: Needleman and Wunsch, J. MoL BioL 48:443-453 (1970) 
Comparison matrix: matches = +10; mismatches = 0 
30 Gap Penalty: 50 

Gap Length Penalty: 3 

A program which can be txsed with these parameters is publicly available as the 
"gap" program from Genetics Computer Group, Madison Wisconsin. The above 
parameters are the default parameters for nucleic acid comparisons. 
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The invention also includes polypeptides of the formula: 
X.(R,V(R,)-(R3)„-Y 

wherein, at the amino temxinus, X is hydrogen, and at the carboxyl terminus, Y is 
hydrogen or a metal, R, and R3 are any amino acid residue, n is an integer between 0 and 
5 1 000, and Rj is an amino acid sequence of the invention, particularly an amino acid 

sequence selected from the group set forth in the Sequence Listing and preferably SEQ ID 
NOs: 3, 5, 7, 9, 74 and 76. In the formula, Rj is oriented so that its amino terminal residue 
is at the left, bound to R,, and its carboxy terminal residue is at the right, bound to R3. 
Any stretch of amino acid residues denoted by either R group, where R is greater than 1, 

1 0 may be either a heteropolymer or a homopolymer, preferably a heteropolymer. 

Polypeptides of the present invention include isolated polypeptides encoded by a 
polynucleotide comprising a sequence selected from the group of a sequence contained in 
SEQ ID NOs: 2, 4, 6, 8, 73 and 75, 

The polypeptides of the present invention can be mature protein or can be part of a 

15 fusion protein. 

Fragments and variants of the polypeptides are also considered to be a part of the 
invention. A fragment is a variant polypeptide which has an amino acid sequence that is 
entirely the same as part but not all of the amino acid sequence of the previously described 
polypeptides. The fragments can be "free-standing" or comprised within a larger 

2 0 polypeptide of which the fragment forms a part or a region, most preferably as a single 
continuous region. Preferred fragments are biologically active fragments which are those 
fragments that mediate activities of the polypeptides of the invention, including those with 
similar activity or improved activity or with a decreased activity. Also included are those 
polypeptides and polypeptide fragments that are antigenic or inununogenic in an animal, 

25 particularly a human and antibodies, either polyclonal or monoclonal that specifically. bind 
the antigenic fragments. In one preferred embodiment, such antigenic or immunogenic 
fragments comprise at least 10 consecutive amino acids from the amino acid sequences 
disclosed herein or such sequences with at least one conservative amino acid substitution. 
In additional embodiments, such antigenic or immunogenic fragments comprise at least 

30 IS, at least 25, at least 50 or at least 100 consecutive amino acids from the amino acid 
sequences disclosed herein or such sequences with at least one conservative amino acid 
substitutipn. Methods for the production of antibodies from polypeptides and 
polypeptides conjugated to cdnier molecules are well known in the art and can be foimd 
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for example in Ausubel et al.. Short Protocols in Molecular Biology, 3"* ed,, Wiley & 
Sons, 1995, particularly chapter 11. 

Variants of the polypeptide also include polypeptides that vaiy from the sequences 
set forth in the Sequence Listing by conservative amino acid substitutions, substitution of 
5 a residue by another with like characteristics* Those of ordinary skill in the art are aware 
that modifications in the amino acid sequence of a peptide, polypeptide, or protein can 
result in equivalent, or possibly improved, second generation peptides, etc, that display 
equivalent or superior functional characteristics when compared to the original ammaacid 
sequence. The present invention accordingly encompasses such modified amino acid 

10 sequences. Alterations can include amino acid insertions, deletions, substitutions, 

truncations, fusions, shuffling of subunit sequences, and the like, provided that the peptide 
sequences produced by such modifications have substantially the same functional 
properties as the naturally occurring coxmterpart sequences disclosed herein. 

One factor that can be considered in making such changes is the hydropathic index 

15 of amino acids. The importance of the hydropathic amino acid index in conferring 

int^ctive biological function on a protein has been discussed by Kyte and Doolittle ( /. 
MoL BioL, 157: 105-132, 1982), It is accepted that the relative hydropathic character of 
amino acids contributes to the secondary structure of the resultant protein* This, in tum» 
affects the interaction of the protein with molecules such as enzymes, substrates, receptors, 

20 DNA, antibodies^ antigens, etc. 

Based on its hydrophobicity and charge characteristics, each amino acid has been 
assigned a hydropathic index as follows: isoleucine (+4.5); valine (+4.2); leucine (+3.8); 
phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1 .9); alanine (+1 .8); glycine (- 
0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); 

2 5 histidine (-3.2); glutamate/glutamine/aspartate/asparagine (-3.5); lysine (-3.9); and 

arginine (-4.iJ). 

As is known in the art, certain amino acids in a peptide or protein can be 
substituted for other amino acids having a similar hydropathic index or score and produce 
a resultant peptide or protein having similar biological activity, i,e., which still retains 

3 0 biological functionality* In making such changes, it is preferable that amino acids having 

hydropathic indices within ±2 are substituted for one another. More prefenred 
substitutions are those wherein the amino acids have hydropathic indices within il . Most 
preferred substitutions are those wherein the amino acids have hydropathic indices within 
±0.5. 
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Like amino acids can also be substituted on the basis of hydrophilicity. U.S. Patent 
No, 4,554,101 discloses that the greatest local average hydrophilicity of a protein, as 
governed by the hydrophilicity of its adjacent amino acids, coirelates with a biological 
property of the protein. The following hydrophilicity values have been assigned to amino 
5 acids: arginine/lysine (+3.0); aspartate/glutamate (+3.0 dbl); serine (+0.3); 
asparagine/glutamine (+0.2); glycine (0); threonine (-0.4); proline (-0.5 ±\); 
alanine/histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); leucme/isoleucine 
(-L8); tyrosine (-23); phenylalanine (-2.5); and tryptophan (-3.4). Thus, one amino acid 
in a peptide, polypeptide, or protein can be substituted by another amino acid having a 

1 0 similar hydrophilicity score and still produce a resultant protein having similar biological 
activity, i.e., still retaming correct biological function. In making such changes, amino 
acids having hydropathic indices within ±2 are preferably substituted for one another, 
those within are more preferred, and those within ±0.5 are most preferred. 

As outlined above, amino acid substitutions in the peptides of the present invention 

1 5 can be based on the relative similarity of the amino acid side-chain substituents, for 

example, their hydrophobicity, hydrophilicity, charge, size, etc. Exemplary substitutions 
that take various of the foregoing characteristics into consideration in order to produce 
conservative amino acid changes resulting in silent changes within the present peptides, 
etc., can be selected from other members of the class to which the naturally occurring 

2 0 amino acid belongs. Amino acids can be divided into the following four groups: (1) acidic 
amino acids; (2) basic amino acids; (3) neutral polar amino acids; and (4) neutral non- 
polar amino acids. Representative amino acids within these various groups include, but 
are not limited to: (1) acidic (negatively charged) amino acids such as aspartic acid and 
glutamic acid; (2) basic (positively charged) amino acids such as arginine, histidine, and 

2 5 lysine; (3) neutral polar amino acids such as glycme, serine, threonine, cysteine, cystine, 

tyrosine, asparagine,-ahd glutamine; and (4) neutral non-polar amino acids such as alanine, 
leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine. It should 
be noted that changes which are not expected to be advantageous can also be useful if 
these result in the production of functional sequences. 

3 0 Variants that are fragments of the polypeptides of the invention can be used to 

produce the corresponding full length polypeptide by peptide synthesis. Therefore, these 
variants can be used as intermediates for producing the full-length polypeptides of the 
invention. 
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The polynucleotides and polypqjtides of the invention can be used, for example, in 
the transfonnation of host cells, such as plant cells, animal cells, yeast ceils, bacteria, 
bacteriophage, and viruses, as further discussed herein. 

The invention also provides polynucleotides that encode a polypeptide that is a 
5 mature protein plus additional amino or carboxyl-terminal amino acids, or amino acids 
within the mature polypeptide (for example, when the mature form of the protein has more 
than one polypeptide chain). Such sequences can, for example, play a role in the 
processing of a protein from a precursor to a mature form, allow protein transport, shorten 
or lengthen protein half-life, or facilitate manipulation of the protein in assays or 
1 0 production. It is contemplated that cellular enzymes can be used to remove any additional 
amino acids firom the mature protein. 

A precursor protein, havinjg the mature form of the polypeptide fused to one cm: 
more prosequences may be an inactive form of the polypeptide. The inactive precursors 
generally are activated when the prosequences are removed. Some or all of the 
1 5 prosequences may be removed, prior to activation. Such precursor protein are generally 
called proproteins. 

Preparation of Expression Constructs and Methods of Use 

Of interest is the use of the nucleotide sequences in recombinant DNA constructs 
20 to direct the transcription or transcription and translation (expression) of the 

acyltransferase sequences of the present invention in a host cell. Of particular interest is . 
the use of the polynucleotide sequences of the present invention in recombinant DNA 
constructs to direct the transcription or transcription and translation (expression) of the 
acyltransferase sequences of the present invention in a host plant cell. 

2 5 The expression constructs generally comprise a regulatory sequence functional in a 

host cell operably linked to a nucleic acid sequence encoding a lecithinxholesterol 
acyltransferase-like polypeptide or acyl CoArcholesterol acyltransferase-like polypeptide 
of the present invention and a transcriptional termination region functional in a host plant 
cell. Of particular interest is the use of promoters (also referred to as transcriptional 

3 0 initiation regions) functional in plant host cells. 

Those skilled in the art will recognize that there are a number of promoters which 
are functional in plant cells, and have been described in the literature including 
constitutive, inducible, tissue specific, organelle specific, developmentally regulated and 
environmentally regulated promoters. Chloroplast andplastid specific promoters. 
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chloroplast or plastid functional promoters, and chloroplast or plastid operable promoters 
are also envisioned. 

One set of promoters are constitutive promoters such as the CaMV35S or FMV35S 
promoters that yield high levels of expression in most plant organs. Enhanced or 
5 duplicated versions of the CaMV35S and FMV35S promoters are useful in the practice of 
this invention (Odell, et aL (1985) Nature 313:810-812; Rogers, U.S. Patent Number 
5,378, 619). Other useful constitutive promoters include, but are not limited to, the 
mannopine synthase {mas) promoter, the nopaline synthase {nos} promoter, and the 
octopine synthase (oc^) promoter. 

10 Useful inducible promoters include heat-shock promoters (Ou-Lee et al. (1986) 

Proc. Natl Acad. Set USA 83: 6815; Ainley et al (1990) Plant Mol Biol 14: 949), a 
nitrate-ihducible promoter derived from the spinach nitrite reductase gene (Back et al. 
(1991) Plant Mol Biol 17: 9), hormone-inducible promoters (Yamaguchi-Shinozaki et al. 
(1990) Plant Mol Biol 15: 905; Kares et aL (1990) Plant Mol Biol 15: 905), and 

1 5 light-inducible promoters associated with the small subunit of RuBP carboxylase and 
LHCP gene families (Kuhlemeier et al. (1989) Plant Cell 1 : 471; Feinbaum et al. (1991) 
Mol Gen. Genet 226: 449; Weisshaar et al. (1991) EMBO J. 10: 1777; Lam and Chua 
(1990) Science 248: 471; Castres'ana et al. (1988) EMBO J. 7: 1929; Schulze-Lefert et al 
(1 989) £A/50/. 8:651). 
.20 In addition, it may also be preferred to bring about expression of the 

acyltransferase gene in specific tissues of the plant, such as leaf, stem, root, tuber, seed, 
fiiiit, etc., and the promoter chosen should have the desired tissue and developmental 
specificity. Examples of useful tissue-specific, developmentally-regulated promoters 
include fruit-specific promoters such as the E4 promoter (Cordes et al. (1989) Plant Cell 

25 1:1025), the E8 promoter (Deikman et al. (1988) EMBO J. 7: 3315), the kiwifiruit 

actinidin promoter (Lin et al. (1993) PNAS 90: 5939), the 2A1 1 promoter (Houck et al, 
U.S. Patent 4,943,674), and the tomato pZ130 promoter (U.S. Patents 5,175, 095 and 
5,530,185); the p-conglycmin 7S promoter (Doyle et al. (1986) 1 Biol Chem. 261 : 9228; 
Slighton and Beachy (1987) Planta 172: 356), and seed-specific promoters (Knutzon et al. 

30 (1992) Proa Natl Acad. Sci USA 89: 2624; Bustos et al. (1991) £MBO/. 10: 1469;Lam 
and Chua (1991) J. Biol Chem. 266: 17131; Stayton et al. (1991) Aust 1 Plant, Physiol 
18: 507). Fruit-specific gene regulation is discussed in U.S. Patent'5,753,475. Other 
useful seed-specific promoters include, but are not limited to, the napin, phaseoUn, zem, 
soybean trypsin inhibitor, 7S, ADR12, ACP, stearoyl-ACP desaturase, oleosin, 



wo 01716308 PCT/USOO/23863 

23 

Lasquerella hydroxylase, and barley aldose reductase promoters (Battels (1995) Plant J. 7: 
809-822), the EA9 promoter (U.S. Patent 5,420,034), and the Bce4 promoter (U.S. Patent 
5,530,194). Useful embryo-specific promoters include the com globulin 1 and oleosin 
promoters. Useful endosperm-J5)ecific promoters include the rice glutelin-1 promoter, the 
5 promoters for the low-pl p amylase gene (Amy32b) (Rogers et al. (1984) / Biol Chem. 
259: 12234), the high-pl p amylase gene (Amy 64) (Khurseed et al. (1988) J. Biol Chem. 
263: 18953), and the promoter for a barley thiol protease gene ("Aleurain") (Whittier et al 
(1987) Nucleic Acids Res. 15: 2515), 

Of particular interest is the expression of the nucleic acid sequences of the present 

1 0 invention fi-om transcription initiation regions which are preferentially expressed in a plant 
seed tissue. Examples of such seed preferential transcription initiation sequences include 
those sequences derived from sequences encoding plant storage protein genes or from 
genes involved in fatty acid biosynthesis, in oilseeds. Examples of such promoters include 
the 5' regulatory regions from such genes as napin (Kridl et al. Seed Scl Res. 7:209:219 

15 (1991)), phaseolm» zein, soybean trypsin inhibitor, ACP, stearoyl-ACP desaturase, 

soybean a' subunit of p-conglycinin (soy 7s, (Chen et al, Proc. Natl Acad, Sci., 83:8560- 
8564 (1986))) and oleosin. Seed-specific gene regulation is discussed in EP 0 255 378 Bl 
and U.S. Patents 5,420,034 and 5,608,152 . Promoter hybrids can also be constructed to 
enhance transcriptional activity (Hofftnan, U.S. Patent No. 5,106,739), or to combine 

2 0 desired transcriptional activity and tissue specificity. 

. It may be advantageous to direct the localization of proteins conferring LCAT to a 
particular subcellular compartment, for example, to the mitochondrion, endoplasmic 
reticulum, vacuoles, chloroplast or other plastidic compartment. For example, where the 
genes of interest of the present invention will be targeted to plastids, such as chloroplasts, 

2 5 for expression, the constructs will also employ the use of sequences to direct the gene to 

the plastid. Such sequences are referred to herein as chloroplast transit peptides (CTP) or 
plastid transit peptides (PTP), In this manner, where the gene of interest is not directly 
inserted into the plastid, the expression construct will additionally contain a gene encoding 
a transit peptide to direct the gene of interest to the plastid. The chloroplast transit 

3 0 peptides may be derived from the gene of interest, or may be derived from a heterologous 

sequence having a CTP. Such transit peptides are known in the art. See, for example, Von 
Heijnee/a/. (1991) Plant Mol Biol Rep. 9:104-126; Clark era/. (1989)7. Biol Chem. 
25-^:17544-17550; della-Cioppa et al (1987) Plant Physiol 5^:965-968; Romer et al 
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(1993) Biochem, Biophys. Res Commun. 796:1414-1421; and, Shah etal. (19S6} Science 
253:478-481. 

Depending upon the intended use, the constructs may contain the nucleic acid 
sequence which encodes the entire LCAT protein, a portion of the LCAT protein, the 
5 entire ACAT protem, or a portion of the ACAT protein^ For example, where antisense 
inhibition of a given LCAT or ACAT protein is desired, the entire sequence is not 
required. Furthermore, where LCAT or ACAT sequences used in constructs are intended 
for use as probes, it may be advantageous to prepare constructs containing only a 
particular portion of a LCAT or ACAT encoding sequence, for example a sequence which 

10 is discovered to encode a highly conserved region. 

The skilled artisan will recognize that there are various methods for the inhibition 
of expression of endogenous sequences in a host cell. Such methods include, but are not 
limited to antisense suppression (Smith, et al (1988) Nature 334:724-726) , co- 
suppression (Napoli, ^^ a/. (1989)P/a/i/Ce// 2:279-289), ribozymes (PCX Publication 

15 WO 97/10328), and combinations of sense and antisense Waterhouse, et al (l;998)jProc, 
Natl Acad. Set USA 95:13959-13964. Methods for the suppression of endogenous 
sequences in a host cell typically employ the transcription or transcription and translation 
of at least a portion of the sequence to be suppressed. Such sequences may be homologous 
to coding as well as non-coding regions of the endogenous sequence. 

2 0 Regulatory transcript termination regions may be provided in plant expression 

constructs of this invention as well Transcript termination regions may be provided by 
the DNA sequence encoding the diacylglycerol acyltransferase or a convenient 
transcription termination region derived from a different gene source, for example, the 
transcript termination region which is naturally associated with the transcript initiation 

2 5 region. The skilled artisan will recognize that any convenient transcript termination region 
which is capable of terminating transcription in a plant cell may be employed in the 
constructs of the present invention. 

Alternatively, constructs may be prepared to direct the expression of the LCAT or 
ACAT sequences directly from the host plant cell plastid. Such constructs and methods 

30 are known in the art and are generally described, for example, in Svab, et al (1 990) Proc. 
Natl Acad. Sci USA 87:8526-8530 and Svab and Maliga (1993)Prac. Natl Acad. ScL 
USA 90:913-917 and m U.S. Patent Number 5,693,507, 

A plant cell, tissue, organ, or plant into which the recombinant DNA constructs 
containing the expression constructs have been introduced is considered transformed. 
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transfected, or transgenic. A transgenic or transformed cell or plant also includes progeny 
of the cell or plant and progeny produced fix>m a breeding program employing such a 
transgenic plant as a parent in a cross and exhibiting an altered phenotype resulting from 
the presence of a LCAT nucleic acid sequence. 
5 Plant expression or transcriiption constructs having a plant LCAT as the DNA 

sequence of interest for increased or decreased expression thereof may be employed with a 
wide variety of plant life, particularly, plant life involved in the production of vegetable 
oils for edible and industrial uses. Most especially preferred are temperate oilseed crops. 
Plants of interest include, but are not limited to, rapeseed (Canola and High Erucic Acid 

10 varieties), sunflower, safflower, cotton, soybean, peanut, coconut and oil palms, and com. 
Depending on the method for introducing the recombinant constructs into the host cell, 
other DNA sequences may be required. Importantly, this invention is applicable to 
dicotyledyons and monocotyledons species alike and will be readily applicable to new 
and/or improved transformation and regulation techniques. 

15 Of particular interest, is the use of plant LCAT and ACAT constructs in plants to 

produce plants or plant parts, including, but not limited to leaves, stems, roots, 
reproductive, and seed, with a modified content of lipid and/or sterol esters and to alter the 
oil production by such plants. 

Of particular interest in the present invention, is the use of ACAT genes in 

2 0 conjunction with the LCAT sequences to increase the sterol content of seeds. Thus, 

overexpression of a nucleic acid sequence encoding an ACAT and LCAT in an oilseed 
crop may find use in the present invention to increase sterol levels in plant tissues and/or 
increase oil production. 

It is contemplated that the gene sequences may be synthesized, either completely or 
25 in part, especially where it is desirable to provide plant-preferred sequences. Thus, all or a . 
portion of the desired structural gene (that portion of the gene which encodes the LCAT or 
ACAT protein) may be synthesized using codons preferred by a selected host Host- 
preferred codons may be determined, for «ample, from the codons used most frequently 
in the proteins expressed in a desired host species. 

3 0 One skilled in the art will readily recognize that antibody preparations, nucleic acid 

probes (DNA arid RNA) and the like may be prepared and used to screen and recover 
"homologous" or "related" sequences from a variety of plant sources. Homologous 
sequences are found when there is an identity of sequence, which may be determined upon 
comparison of sequence information, nucleic acid or amino acid, or through hybridization 
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reactions between a known LCAT and a candidate source. Conservative changes, such as 
Glu/Asp, Val/Ile, Ser/Thr, Arg/Lys and Gln/Asn may also be considered in determining 
sequence homology. Amino acid sequences are considered homologous by as little as 
25% sequence identity between the two complete mature proteins. (See generally, 
5 Doolittle, OF URFS and (University Science Books, CA, 1986;) 

Thus, other LC ATs may be obtained from the specific sequences provided herein. 
Furthermore, it will be apparent that one can obtain natural and synthetic sequences, 
including modifieid amino acid sequences and starting materials for synthetic-protein 
modeling from the exemplified LCAT and ACAT sequences and from sequences which 

10 are obtained tlirough the use of such exemplified sequences. Modified amino acid 

sequences include sequences which have been mutated, truncated, increased and the like, 
whetlier such sequences were partially or wholly synthesized. Sequences which are 
actually purified from plant preparations or are identical or encode identical proteins 
thereto, regardless of the method used to obtain the protein or sequence, are equally 

15 considered naturally derived. ^ 

For immunological screening, antibodies to the protein can be prepared by 
injecting rabbits or mice with the purified protein or portion thereof, such methods of 
preparing antibodies being well known to those in the art. Either monoclonal or 
polyclonal antibodies can be produced, although typically polyclonal antibodies are more 

2 0 useful for gene isolation. West^ analysis may be conducted to determine that a related 

protein is present in a crude extract of the desired plant species, as determined by cross- 
reaction with the antibodies to the encoded proteins. When cross-reactivity is observed, 
genes encoding the related proteins are isolated by screening expression libraries 
representing the desired plant species. Expression libraries can be constructed in a variety 
25 . of commercially available vectors, including lambda gtll, as described in Sambrook, et ai 
{Molecular Cloning: A Laboratory Manml, Second Edition (1989) Cold Spring Harbor 
Laboratory, Cold Spring Haibor, New York). 

To confirm the activity and specificity of the proteins encoded by the identified 
nucleic acid sequences as acyltransferase enzymes, in vitro assays are performed in insect 

3 0 cell cultures using baculovirus expression systems. Such baculovirus expression systems 

are known in the art and are described by Lee, et al U.S. Patent Number 5,348,886, the 
entirety of which is herein incorporated by reference. 

In addition, other expression constructs may be prepared to assay for protein 
activity utilizing different expression systems. Such expression constructs are transformed 
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into yeast or prokaryotic host amd assayed for acyltransferase activity. Such expression 
systems are known in the art and are readily available through commercial sources. 

The method of transformation in obtaining such transgenic plants is not critical to 
the instant invention, and various methods of plant transformation are currently available. 
5 Furthermore, as newer methods become available to transform crops, they may also be 
directly applied hereunder. For example, many plant species naturally susceptible to 
Agrobacterium infection may be successfiilly transformed via tripartite or binary vector 
methods of Agrobacterium mediated transformation. In many instances, it will be 
desirable to have the construct bordered on one or both sides by T-DNA, particularly 

1 0 having the left and right borders, more particularly the right border. This is particularly 
useful when the construct uses A. tumefaciens or A. rhizogenes as a mode for 
transformation, although the T-DNA borders may find use with other modes of 
transformation. In addition, techniques of microinjection, DNA particle bombardment, 
and electroporation have been developed which allow for the transformation of various 

1 5 monocot and dicot plant species. 

Normally, included with the DNA construct will be a structural gene having the 
necessary regulatory regions for expression in a host and providing for selection of 
transformant cells. The gene may provide for resistance to a cytotoxic agent, e.g. 
antibiotic, heavy metal, toxin, etc., complementation providing prototrophy to an 

2 0 auxotrophic host, viral immunity or the like. Depending upon the number of different host 
species the expression construct or components thereof are introduced, one or more 
markers may be employed, where different conditions for selection are used for the 
different hosts. 

Non-limiting examples of suitable selection markers include genes that confer 
25 resistance to bleomycin, gentamycin, glyphosate, hygromycin, kanaraycin, methotrexate, 
phleomycin, phosphinotricin, spectinomycin, streptomycin, sulfonamide and 
sulfonylureas. Maliga et al.. Methods in Plant Molecular Biology^ Cold Spring Harbor 
Laboratory Press, 1995, p. 39. Examples of markers include, but are not limited to, 
alkaline phosphatase (AP), myc, hemagglutinin (HA), p glucuronidase (GUS), luciferase, 
30 and green fluorescent protein (GFP). 

Where Agrobacterium is used for plant cell transformation, a vector may be used 
which may be introduced into the Agrobacterium host for homologous recombination with 
T-DNA or the Ti- or Ri-plasmid present in the Agrobacterium host. The Ti- or Ri-plasmid 
contammg the T-DNA for recombination may be armed (capable of causing gall 
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formation) or disarmed (incapable of causing gall formation), the latter being permissible, 
so long as the v/r genes are present in the transformed Agrohacterium host. The armed 
plasmid can give a mixture of normal plant cells and gall. 

In some instances where i4gro6acrerztt/n is used as the vehicle for transforming 
host plant cells, the expression or transcription construct bordered by the T-DNA border 
region(s) will be inserted into a broad host range vector capable of replication in E. coli 
and Agrobacterium, there being broad host range vectors described in the literature. 
Commonly used is pRK2 or derivatives thereof. See, for example, Ditta, et al.^ (Proa Nat 
Acad. ScL U.SA. (1980) 77:7347-7351) and EPA 0 120 515, which are incorporated 
herein by reference, Alternatively, one may insert the sequences to be expressed in plant 
cells into a vector containing separate replication sequences, one of which stabilizes the 
vector in E, coli, and the other in Agrohacterium. See, for example, McBride and 
Summerfelt (Plant Mol Biol (1990) ;-/:269-276), wherein the pRiHRI (Jouanin, et al, . 
Mol Gen. Genet. (1985) 201:370-374) origin of replication is utilized and provides for 
added stability of the plant expression vectors in host Agrohacterium cells. - 

Included with die expression construct and the T-DNA can be one or more 
markers, which allow for selection of transformed Agrohacterium and transformed plant 
cells. A 

number of markers have been developed for use with plant cells, such as resistance to 
chloramphenicol, kanamycin, the aminoglycoside G41 8, hygromycin, or the like. The 
particular marker employed is not essential to this invention, one or another marker being 
preferred depending on the particular host and the manner of construction. 

For transformation of plant cells using Agrohacteriumy explants may be combined 
and incubated with the transformed Agrohacterium for sufficient time for transformation, 
the bacteria killed, and the plant cells cultured in an appropriate selective medium. Once 
callus forms, shoot formation can be encouraged by employing the appropriate plant 
hormones in accordance with known methods and the shoots transferred to rooting 
medium for regeneration of plants. The plants may then be grown to seed and the seed 
used to establish repetitive generations and for isolation of vegetable oils. 

Thus, in another aspect of the present invention, methods for modifying the sterol 
and/or stanol composition of a host cell. Of particular interest are methods for modifying 
the sterol and/or stanol composition of a host plant cell. In general the methods involve 
either increasing the levels of sterol ester compounds as a proportion of the total sterol 
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compounds* The method generally comprises the use of expression constructs to direct the 
expression of the polynucleotides of the present invention in a host cell. 

Also provided axe methods for reducing the proportion of sterol ester compounds 
as a percentage of total sterol compounds in a host plant cell The method generally 
5 comprises the use of expression constructs to direct the suppression of endogenous 
acyltransferase proteins in a host cell 

Of particular interest is the use of expression constructs to modify the levels of 
sterol compounds in a host plant cell Most particular, the methods find use in modifying 
the levels of sterol compounds in seed oils obtained &om plant seeds. 

1 0 Also of interest is the use of expression constructs of the present invention to alter 

oil production in a host cell and in particular to increase oil production. Of particular 
interest is the use of expression constructs containing nucleic acid sequences encoding 
LCAT and/or ACAT polypeptides to transform host plant cells and to use these host cells 
to regenerate whole plants having increase oil production as compared to the same plant 

1 5 not containing the expression construct. 

The oils obtained from transgenic plants having modified sterol compound content 
find use in a wide variety of applications. Of particular interest in the present invention is 
the use of the oils containing modified levels of sterol compounds m applications involved 
in improving human nutrition and cardiovascular health. For example^ phytostanols are 

20 beneficial for lowering serum cholesterol (Ling, et al (1995) Life Sciences 57:195-206). 

Cholesterol-low^g compositions comprise the oils and sterol ester compound 
compositions obtained using the methods of tlie present invention. Such cholesterol 
lowering compositions include, but are not limited to foods, food products, processed 
foods, food ingredients, food additive compositions, or dietary/nutritional supplements 

25 that contain oils and/or fats. Non-limiting examples include margarines; butters; 
shortenings; cooking oils; frying oils; dressmgs, such as salad dressings; spreads; 
mayonnaises; and vitamin/mineral supplements. Patent documents relating to such 
compositions include, US. Patents 4,588,717 and 5,244,887, and PCT International 
Publication Nos. WO 96/38047, WO 97/42830, WO 98/06405, and WO 98/06714. 

3 0 Additional non-limiting examples include toppings; dairy products such as cheese and 
processed cheese; pirocessed meat; pastas; sauces; cereals; desserts, including frozen and 
shelf-stable desserts; dips; chips; baked goods; pastries; cookies; snackbars; 
confections; chocolates; beverages; unextracted seed; and unextracted seed that has been 
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ground, cracked, milled, rolled, extruded, pelleted, defatted, dehydrated, or otherwise 
processed, but which still contains the oils, etc.» disclosed herein. 

The cholesterol-lowering compositions can also take the form of pharmaceutical 
compositions comprising a cholesterol-lowering effective amount of the oils or sterol 
5 compound compositions obtained using the methods of the present nivention, along with a 
pharmaceutically acceptable carrier, excipient, or diluent. These pharmaceutical 
compositions can be in the form of a liquid or a solid. Liquids can be solutions or 
suspensions; solids can be in the form of a powder, a granule, a pill, a tablet, a gel, or an 
extrudate. U.S, Patent 5,270,041 relates to sterol-containing pharmaceutical compositions. 

1 0 Thus, by expression of the nucleic acid sequences encoding acyltransferase-like 

sequences of the present invention in a host cell, it is possible to modify the lipid content 
and/or composition as well as the sterol content and/or composition of the host cell 

The invention now being generally described, it will be more readily understood by 
reference to the following examples which are included for purposes of illustration only 

15 and are not intended to limit the present invention. *i 

EXAMPLES 

Example 1: RNA Isolations 

Total RNA from the inflorescence and developing seeds of Arabidopsis thaliana 
2 0 was isolated for use in construction of complementary (cDNA) libraries. The procedure 
was an adaptation of the DNA isolation protocol of Webb and Rnapp (D.M. Webb and S.J. 
Rnapp, (1990) Plant Molec. Reporter, 8, 180-185). The following description assumes the 
use of Ig fresh weight of tissue. Frozen seed tissue was powdered by grinding under 
liquid nitrogen. The powder was added to 10ml REC buffer (50mM Tris-HCl, pH 9, 0.8M 

2 5 NaCl, 1 OmM EDTA, 0.5% w/v CTAB (cetyltrimethyl-ammonium bromide)) along with 

0.2g insoluble polyvinylpolypyrrolidone, and ground at room temperature. The 
homogenate was centrifuged for 5 minutes at 12,000 xg to pellet insoluble material. The 
resulting supernatant fraction was extracted with chloroform, and the top phase was 
recovered. 

3 0 The RNA was then precipitated by addition of 1 volume RecP (50mM Tris-HCX 

pH9, lOmM EDTA and 0.5% (w/v) CTAB) and collected by brief centrifiigation as before. 
The RNA pellet was redissolved m 0.4 ml of IM NaCl. The RNA pellet was redissolved 
in water and extracted with phenol/chloroform. Sufficient 3M potassium acetate (pH 5) 
ws added to make the mixture 0.3M in acetate, followed by addition of two volumes of 
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ethanol to precipitate the KNA. After washing with ethanol, this final RNA precipitate 
was dissolved in water and stored firozen. 

Alternatively, total RNA may be obtained using TRIzol reagent (BRL- 
Lifetechnologies, Gaithersburg, MD) following the mahufacturer's protocol. The RNA 
5 precipitate was dissolved in water and stored firozen. 

Example 2: Identification of LCAT Sequences 

Searches were performed on a Silicon Graphics Unix computer using additional 
Bioaccellerator hardware and Gen Web software supplied by Compugen Ltd, This 

10 software and hardware enabled the use of the Smith- Waterman algorithm in searching 
DNA and protein databases using profiles as queries. The program used to query protein 
databases was profilesearch. This is a search where the query is not a single sequence but 
a profile based on a multiple alignment of amino acid or nucleic acid sequences. The 
profile was used to query a sequence data set, i.e., a sequence database. The profile 

1 5 contained all the pertinent information for scoring each position in a sequence, in effect 
replacing the "scorinig matrix" used for the standard query searches. The program used to 
query nucleotide databases with a protein profile was tprofilesearch. Tprofilesearch 
searches nucleic acid databases using an ainino acid profile query. As the search is 
running, sequences in the database are translated to amino acid sequences in six reading 

2 0 firames. The output file for tprofilesearch is identical to the output file for profilesearch 
except for an additional column that indicates the frame in which the best alignment 
occurred. 

The Smith- Waterman algorithm, (Smith and Waterman (1981) */ Molec, Biol 
147:195-197), was used to search for similarities between one sequence from the query 
25 and a group of sequences contamed in the database. 

A protein sequence of Lecithin: cholesterol acyltransferase from human (McLean 
J, et ai (1986) Nucleic Acids Res. 14(23):9397-406 SEQ ID N0:1)) was used to search the 
NCBI non-redundant protein database using BLAST. Three sequences were identified 
firom Arabidopsis^ GenBank accessions AC004557 ( referred to herein as LCATl, SEQ ID 
30 N0:2), AC003027 (referred to herem as LCAT2, SEQ ID N0:4), and AL024486 (referred- 
to herein as LCAT3, SEQ ID N0:6). The deduced amino acid sequences are provided in 
SEQ ID NOs: 3, 5, and 7, respectively. 

The profile generated from the queries using PSI-BLAST was excised firom the 
hyper text markup language (html) file. The worldwide web (www)/html interface to 
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psiblast at ncbi stores the current generated profile matrix in a hidden field in the html file 
that is returned after each iteration of psiblast. However, this matrix has been encoded into 
string62 (s62) fonnat for ease of transport through html String62 format is a simple 
conversion of the values of the matrix into html legal ascii characters. 
5 The encoded matrix width (x axis>is 26 characters, and comprise the consensus 

characters, the probabilities of each amino acid in the order AJ3>CJD,E,F,GJI,I,K,L,M,N, 
P,Q,R,S,T,V,W,X,Y,Z (where B represents D and N, and Z represents Q and E, and X 
represents any amino acid), gap creation value, and gap extension value. 

The length (y axis) of the matrix corresponds to the length of the sequences 
10 identified by PSI-BLAST. The order of the amino acids corresponds to the conserved 
amino acid sequence of the sequences identified using PSI-BLAST, with the N-terminal 
end at the top of the matrix. The probabilities of other amino acids at that position are 
represented for each amino acid along the x axis, below the respective single letter amino 
acid abbreviation. 

1 5 Thus, each row of the profile consists of the highest scoring (consensus) amino 

acid, followed by the scores for each possible amino acid at that position in sequence 
matrix, the score for opening a gap that that position, and the score for continuing a gap at 
that position. 

The string62 file is converted back into a profile for use ui subsequent searches. 
2 0 The gap open field is set to 1 1 and the g^ extension field is set to 1 along the x axis. The 
gap creation and gap extension values are known, based on the settings given to the PSI- 
BLAST algorithm. The matrix is exported to the standard GCG profile form. This fonnat 
can be read by Gen Web. 

The algorithm used to convert the string62 formatted file to the matrix is outlined 
25 in Table 1. 
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Table 1 



10 



1 . if encoded character z then the value is blast score min 

2. if encoded character Z then the value is blast score niax 

3. else if the encoded character is uppercase then its value is (64-(ascii # of char)) 

4. else if the encoded character is a digit the value is ((ascii # of char)-48) 

5. else if the encoded character is not uppercase then the value is ((ascii # of char) - 87> 

6. ALL B positions are set to min of D and N amino acids at that row in sequence matrix 

7. ALL Z positions are set to min of Q amd E amino acids at that row in sequence matrix 

8. ALL X positions are set to min of all amino acids at that row in sequence matrix 

9. kBLAST_SCORE_MAX=999; 

10. kBLAST^SCORE_MIN==-999; 
U. all gap opens are setto 11 

12. all gap lens are set to 1 
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The protein sequences of LCATl, LCAT2, and LCAT3 as well as the PSI-BLAST 
profile were used to search public and proprietary databases for additional LCAT 
sequences. Two EST sequences were identified which appear to be identical to LCATl 
and LCAT3, respectively. One additional Arabidopsis sequence was identified firom the 
proprietary databases, LCAT4 (SEQ ID NO:8). The deduced protein sequence of LCAT4 
is provided in SEQ ID N0:9. Two additional genomic sequences were identified using the 
PSI-BLAST profile from libraries of Arabidopsis ecotypes Columbia and Landsberg, 
LCAT7(SEQIDNO:10)andLCAT8 (SEQ IDNOrll). The LCAT7 sequence was 
present in both the Columbia and Landsberg genomic libraries, while the LC AT8 sequence 
was only present in the Columbia library. 

An open reading frame was predicted from the genomic sequence of LCAT7 in the 
Arabidopsis public database and this sequence was called MSH12 (referred to herein as 
LCAT5, SEQ ID NO: 73), The deduced protem sequence of LCAT5 is provided in SEQ 
ID NO: 74. 

The PSI-BLAST profile and the LCAT sequences were used to query the public 
yeast database and proprietary libraries containing com and soy EST sequences. The yeast 
genome contains only one gene, LROl (LCAT Related Open reading frame, YNR008W, 
Figure 1) with distinct similarity to the human LCAT. The DNA sequence of LROl is 
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provided in SEQ ID NO: 75 and the protein sequence is provided in SEQ ED NO: 76. 
Seven EST sequences were ideaitified firom soybean libraries as being LCAT sequences. 
Two sequences from soy (SEQ ID NOs: 12 and 13) are most closely related to the 
Arahidopsis LCATl sequence, a single sequence was identified as being most closely 
5 related to LCAT2 (SEQ ID N0:14) , three were closely related to LCAT3 (SEQ ID NOs: 
1 5-17), and an additional single sequaice was identified (SEQ ID NO: 1 8). A total of 1 1 
com EST sequences were identified as being related to (lasArabidopsis LCAT sequences. 
Two com EST sequences (SEQ ID NOs: 19 and 20) were most closely related to LCATl, 
two sequences were identified as closely related to LCAT2 (SEQ ID NOs: 21 and 22), four 
10 com EST sequences were identified as closely related to LCAT3 (SEQ ID NOs: 23-26), 
and an additional three com EST sequences were also identified (SEQ ID NOs: 27-29). 

Example 3: Identification of AC AT Sequences 

Since plant ACATs are unknown in the art,, searches were performed to identify 
1 5 known and related ACAT sequences from mammalian sources from public databases. 

These sequences were then used to search public and proprietary EST databases to identify 
plant ACAT-like sequences. 

A public database containing mouse Expressed Sequence Tag (EST) sequences 
(dBEST) was searched for ACAT-like sequences. The search identified two sequences 
2 0 (SEQ ID 30 and 3 1) which ware related (approximately 20% identical), but divergent, to 
known ACAT sequences. 

In order to identify ACAT-like sequences from other organisms, the two mouse 
ACAT sequaices were used to search public and proprietary databases containing EST 
sequences from hunian and rat tissues. Results of the search identified several sequences 

2 5 from the human database and from the rat database which were closely related to the 

mouse sequences. The human and rat ACAT-like EST sequences were assembled, using 
the GCG assembly program, to construct a complete inferred cDNA sequence by 
identifying overlapping sequences (SEQ ID NOs: 32 and 33, respectively). 

The protem sequence of the human ACAT-like sequence was aligned with known 

3 0 ACAT sequences from human (Chang, et at (1993) /. Biol. Chem. 268:20747-20755, 

SEQ ID NO:34), mouse (Uelmen, et al (1995) J. Biol. Chem. 270:26192-26201 SEQ ID 
NO:35) and yeast (Yu, et al (1996) /. Biol. Chem. 271 :24157-24163, SEQ ID NO:36 and 
Yang, et al (1996) Science 272:1353-1356, SEQ ID NO:37) using MacVector (Oxford 
Molecular, Inc.). Results of the alignment demonstrated that the sequence was related to 
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the known sequences, however the related sequence was only about 25% similar to the 
known sequences. 

The protein sequence of the human sterol 0-acyltransferase (ACAT, Acyl 
CoA:Cholesterol acyltransferase. Accession number A48026) related sequence was used 
5 to search protein and nucleic acid Genbank databases. A single plant homologue was 
identified in the public Arabidopsis EST database (Accession A042298, SEQ ID NO:38). 
The protein sequence (SEQ ID NO:39)was translated from the EST sequence, and was 
found to contain a peptide sequence conserved in both mammalian and yeast ACATs 
(Chang et al, (1997) Ann. Rev. Biochem., 66:613-638). 

10 To obtain the entire coding region corresponding to the Arabidopsis ACAT-like 

EST, synthetic oligo-nucleotide primers were designed to amplify the 5' and 3' ends of . 
partial cDNA clones containing ACAT-like sequences. Primers were designed according 
to the Arabidopsis ACAT-like EST sequence and were used in Rapid Amplification of 
cDNA Ends (RACE) reactions (Frohman «^ al (1988) Proc. Natl. Acad. Sci. USA 

15 85:8998-90(S). 

Primers were designed (5'-TGCAAATTGACGAGCACACCAACCCCTTC-3' 
(SEQ ID NO:40) and 5'.AAGGATGCTTTGAGTTCCTGACAATAGG-3* (SEQ ID 
NO:41)) to amplify the 5' end from the Arabidopsis ACAT EST sequence. Amplification 
of flanking sequences from cDNA clones were performed usmg the Marathon cDNA 

20 Amplification kit (Clontech, CA), 

The sequence derived from the 5*-RACE amplification was used to search 
proprietary Arabidopsis EST libraries. A smgle EST accession, LIB25-088-C7 (SEQ ID J 
NO:42), was identified which contained a sequence identical to the 5 '-RACE sequence. 
Furthermore, LIB25-088-C7 was found to contain the complete putative coding sequence 

25 for the ^roAfrfopjr/^ ACAT-like product 

The nucleic acid as well as the putative translation product sequences of A042298 
were used to search public and proprietary databases. Four EST sequences were identified 
in both soybean (SEQ ID NOs:43-46) and maize (SEQ ID NOs:47-50) proprietary 
databases, and a single ACAT-like sequence was identified &om Mortierrella alpina EST 

3 0 sequences (SEQ ID N0:5 1). 

Sequence alignments between ACAT sequences fix)m several different sources 
were compared to identify the similarity between the sequences. Nucleotide sequences 
from known human and mouse ACATs, as well as nucleotide sequences from iaiown yeast 
ACATs were compared to the ACAT-like EST sequences fix>m human and Arabidopsis. 
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Analysis of the sequence alignments revealed several classes of ACATs based on 
sequence similarity. The known human and mouse ACATs, being 88% similar in the 
nucleotide sequence, formed one class of ACATs. Another class of ACATs included the 
yeast ACATs which are less than 20% similar to the known human and mouse class 
ACATs. 

The final class of ACATs included the Arabidopsis and human sequences disclosed 
in the present invention. This class is approximately 22% similar to the known human and 
mouse ACAT class and approximately 23% similar to the yeast class of ACATs. Thus, 
the ACAT sequences disclosed in the present invention represent a novel class of ACAT 
enzymes. Partial mouse sequences of this class are also provided. 

Example 4: Expression Construct Preparation 

Constructs were prepared to direct expression of the LCATl, LCAT2, LCAT3, 
LCAT4, LCAT5 and the yeast LROl sequences in plants and cultured insect cells. The 
entire coding region of each LCAT was amplified from the appropriate EST clone or an 
Arabidopsis genomic cDNA library using the following oligonucleotide primers in a 
polymerase chain reactions (PGR). The LCATl codmg sequence was amplified fi-om the 
EST clone Lib25-082-Ql-El-G4 using the primers 

5'-GGATCCGCGGCCGCACAATGAAAAAAATATCTTCACATTATTCGG-3* (SEQ 
ID NO:52) and 5'.GGATCCCCTGCAGGTCATTCATTGACGGCATTAACATTGG-3' 
(SEQ ID NO:53). The LC AT2 coding sequence was amplified from an Arabidopsis 
genomic cDNA library using the synthetic oligo nucleotide primers 
5'-GGATCCGCGGCCGCACAATGGGAGCGAATTCGAAATCAGTAACG-3' (SEQ 
ID NO:54) and 5^GGATCCCCTGCAGGTTAATACCCACTTTTATCAAGCTCCC-3' 
(SEQ ID NO:55), The LCAT3 coding sequence was amplified from the EST clone . 
LIB22-004-Q1-E1-B4 using the synthetic oligo nucleotide primers 
5'-GGATCCGCGGCCGCACAATGTCTCTATTACTGGAA GAGATC-3' (SEQ ID 
NO:56) and S'-GGATCCCCTGCAGGTTATGCATC AACAGAGACACTTACAGC-3^ 
(SEQ ED NO:57) . The LCAT4 coding sequence was amplified from the EST clone 
LIB23-007-Q1-E1-B5 using the synthetic oligo nucleotide primers 
5'-GGATCC(jCGGCCGCACAATGGGCTGGATTCCGTGTCCGTGC-3' (SEQ ID 
NO:58)and 5'-GGATCCCCTGCAGGTTAACCAGAATCAACTACTTTGTG-3' (SEQ 
ID NO:59). The LCAT5 coding sequence was amplified fix>m LIB23-053-Q1-E1-E3 using 
the synthetic oligo nucleotide primers 
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5'-GGATCeGCGGCCGCACAATGCCCCTTATTCATCGG-3* (SEQ ID NO:77) and 5^- 
GGATCCCCTGCAGGTCACAGCTTCAGGTCAATACG-3' (SEQ ID NO:78). 

The yeast LROI coding sequence was araplified from genomic yeast DNA using 
the synthetic oUgo nucleotide primers 
5 5'GGATCCGCGGCCGCACAATGGGCACACTGTTTCGAAG3' (SEQ ID NO:79) 
and 5'GGATCCCCTGCAGGTTACATTGGGAAGGGCATCTGAG3' (SEQ ID NO: 80). 

The entire coding region of iheArabidopsis ACAT sequence (SEQ ID NO: 42) 
was amplified from the EST clone LIB25-088-C7 using oligonucleotide primers 
5^TCGACCTGCAGGAAGeTTAGAAATGGCGATTTTGGATTC-3' (SEQ ID NO: 60) 
10 and 5'<GGATCCGCGGCCGCTCATGACATCGATCCTTTTCGG-3' (SEQ ID NO: 61) 
in a polymerase chain reaction (PGR). 

Each resulting PGR product was subcloned into pCR2. 1 Topo (Invitrogen) and 
labeled pCGN9964 (LCATl), pCGN9985 (LCAT2), pCGN9965 (LCAT3), pCGN9995 
(LCAT4), pCGN10964 (LCAT5). pCGN10963 (LROll and pCGN8626 (ACAT). 
1 5 Double stranded DNA sequence was obtained to verify that no errors were introduced by 
the PGR amplification. 



4A. Baculovirus Expression Constructs 

Constructs are prepared to direct the expression of the Arabidopsis LCAT and 
2 0 yeast LCAT sequences in cultured insect cells* The entire coding region of the LCAT 

proteins was removed &om the respective constructs by digestion with Notl and &e8387I, 
followed by gel electrophoresis and gel purification* The fragments containing the LCAT 
coding sequences were cloned into Notl and Pstl digested baculovirus expression vector 
pFastBacl (Gibco-BRL, Gaithersburg, MD), Hie resulting baculovirus expression 

2 5 constructs were referred to as pCGN9992 (LCATl), pCGN9993 (LCAT2), pCGN9994 

(LCAT3), pCGN10900 (LCAT4), pCGN10967 (LCAT5), and pCGN10962 (JLROl). 

4B. Plant Expression Construct Preparation 

A plasmid containing the napin cassette derived from pCGN3223 (described in 

3 0 U.S. Patent No. 5,639,790, the entirety of which is incorporated herein by reference) was 

modified to make it more usefiil for cloning large DNA fragments containing multiple 
restriction sites, and to allow the cloning of multiple napin fi;ision genes into plant binary 
transformation vectors. An adapter comprised of the self annealed oligonucleotide of 
sequence 5'- 
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CGCGATTTAAAIGGCGCGCCCTGCAGGCGGCCGCCTGCAGGGCGCGCCATTTA 
AAT-3' (SEQ ID NO:62) was ligated into the cloning vector pBC SK+ (Stratagene) after 
digestion with the restriction endonuclease BssHII to construct vector pCGN7765. 
Plamids pCGN3223 and pCGN7765 were digested with NotI and ligated together. The 
5 resultant vector, pCGN7770, contained the pCGN7765 backbone with the napin seed 
specific expression cassette fixjra pCGN3223. 

The cloning cassette. pCGN7787, contained essentially the same regulatory 
elements as pCGN7770, with the exception of the napin regulatory regions of pCGN7770 
have been replaced with the double CAMV 35S promoter and the tnU polyadenylation and 

10 transcriptional termination region. 

A binary vector for plant transformation, pCGN5139, was constmcted from 
pCGN1558 (McBride and Summerfelt, (1990) Plant Molecular Biology, 14:269-276). In 
pCGN5139, the polylinker of pCGN1558 was replaced as a HindIII/Asp71 8 fragment with 
a polylinker contsdning unique restriction endonuclease sites, AscI, Pad, Xbal, Swal, 

15 BamHI,and Notl. The Asp718 and Hindin restriction endonuclease sites are retained in 
pCGN5139. 

A series of turbo binary vectors was constructed to allow for the r^id cloning of 
DNA sequences into binary vectors containing transcriptional initiation r^ons .;. 
(promoters) and transcriptional termination regions. 

20 The plasmid pCGN861 8 was coiKtructed by ligating oligonucleotides 

5'-TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGG-3' (SEQ ID NO:63) and 
5*-TCGACCTGCAGGAAGCTTGCGGCCGCGGATCC-3' (SEQ ID NO:64) into 
Sall/XhoI-digested pCGN7770. A fragment containing the napin promoter, polylinker and 
napin 3' region was excised from pCGN8618 by digestion with Asp718I; the fragment 

25 was blunt-ended by filling in the 5' overhangs with Klenow fragment then ligated into 
pCGN5139 that had been digested with Asp718I and Hindm and blunt-ended by filling in 
the 5 ' overhangs with Klenow fragment. A plasmid containing the insert oriented so that 
the napin promoter was closest to the blunted Asp718I site of pCGN5 139 and the napin 3' 
was closest to the bhmted Hindlll site was subjected to sequ«ice analysis to confirm both 

30 the insert orientation and the integrity of clpning junctions. The resulting plasmid was 
designated pCGN8622. 

The plasmid pCGN8619 was constructed by ligating oligonucleotides 
5*-TCGACCTGCAGGAAGCTTGCGGCCGCGGATCC-3' (SEQ ID NO:65) and 
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5*-TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGG-3* (SEQIDNO:66) into 
SaU/XhoI-digested pCGN7770. A fragment containing the napin promoter, polylinker and 
napin 3' region was removed from pCGN8619 by digestion with Asp718I; the fragment 
was blunt-ended by filling in the 5* overhangs with Klenow fragment then ligated into 
5 pCGN5 139 that had been digested with Asp718I and ffindin and blunt-ended by filling in 
the 5' overhangs with Klenow fragment. A plasmid containing the insert oriented so that 
the napin promoter was closest to the blunted Asp718I site of pCGN5139 and the napin 3' 
was closest to the blunted HindEH site was subjected to sequence analysis to confirm both 
the insert orientation and the integrity of cloning junctions. The resulting plasmid was 

1 0 designated pCGN8623. 

The plasmid pCGN8620 was constructed by hgating oligonucleotides 
5 '-TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGGAGCT -3' (SEQ ID NO:67) and 
5^CCTGCAGGAAGCTTGCGGCCGCGGATCC-S* (SEQ ID NO:68) into Sall/Sacl- 
digested pCGN7787, A fragment containing the d35S promoter, polylinker and tml 3' 

1 5 region was removed from pCGN8620 by complete digestion with Asp71 81 and partial 
digestion withNotl. The fragment was blunt-ended by filling in tfie 5* overhangs with 
Klenow fragment then ligated into pCGN5139 that had been digested with Asp718I and 
Hindm and blunt-ended by filling in the 5* ovrahangs with Klenow fragment. A plasmid 
containing the insert oriented so that the d3SS promoter was closest to the blunted 

2 0 Asp718I site of pCGN5139 and the tml 3' was closest to the blunted Hindm site was 

subjected to sequence analysis to confirm both the insert orientation and the integrity of 
cloning junctions. The resulting plasmid was designated pCGN8624. 

The plasmid pCGN8621 was constructed by ligating oligonucleotides 
5'-TCGACCTGCAGGAAGCTTGCGGCCGCGGATCCAGCT -3* (SEQ ID NO:69) and 
25 5'-GGATCCGCGGCCGCAAGCTTCCTGCAGG-3'(SEQIDNO:70) intoSall/SacI- 
digested pCGN7787. A fragmrat containing the d35S promoter, polylinker and tml 3* 
region was removed from pCGN8621 by complete digestion with Asp718I and partial 
digestion with Notl. The fragment was blunt-ended by filling m the 5' overhangs with 
Klenow fragment then ligated into pCGN5139 that had bem digested with Asp718I and 

3 0 Hindlll and blunt-ended by filling in the S' overhangs with EJenow fiagment. A plasmid 

containing the insert oriented so that the d35S promoter was closest to the blunted 
Asp718I site of pCGN5139 and the tml 3' was closest to the blunted HindlU site was 
subjected to sequence analysis to confirm both the insert orientation and the integrity of 
cloning junctions. The resulting plasmid was designated pCGN8625. 
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The plasmid construct pCGN8640 is a modification of pCGN8624 described 
above. A 938bp PstI fragment isolated from transposon Tn7 which encodes bacterial 
spectinoraycin and streptomycin resistance (Fling et al. (1985), Nucleic Acids Research 
13(19):7095-7106), a detenninant for E. coll and Agrobacterium selection, was blunt 
5 ended with Pfu polymerase. The blunt ended fragment was ligated into pCGN8624 that 
had been digested with Spel and blunt ended with Pfu polymerase. The region containing 
the PstI fragment was sequenced to confirm both the insert orientation and the integrity of 
cloning junctions. 

The spectinomycin resistance marker was introduced into pCGN8622 and 
1 0 pCGN8623 as follows. A 7.7 Kbp AvrH-SnaBI fragment from pCGN8640 was ligated to 
a 10.9 Kbp Avrll-SnaBI fragment from pCGN8623 or pCGN8622, described above. The 
resulting plasmids were pCGN8641 and pCGN8643, respectively. 

The plasmid pCGN8644 was constructed by ligating oligonucleotides 
5'-GATCACCTGCAGGAAGCTTGCGGCCGCGGATCCAATGCA-3' (SEQIDN0:71) 
IS and 5'.TTGGATCCGCGGCCGCAAGCTTCCTGCAGGT-3' (SEQ ID NO:72) into 
BamHI-PstI digested pCGN8640. 

4C. Plant LCAT Expression Construct Preparation 

The coding sequence of LCATl was cloned from pCGN9964 as a NotV &e8387I 
2 0 fragment into pCGN8640, pCGN8641, pCGN8643, and pCGN8644 to create the 

expression constracts pCGN9960, pCGN9961, pCGN9962, and pCGN9963, respectively. 
The construct pCGN9960 was designed to express the LCATl codmg sequence in the 
sense orientation from the constitutive promoter CaMV 35S. The construct pCGN9961 
was designed to express the LCATl coding sequence in the antisense orientation from the 

2 5 napin promoter. The construct pCGN9962 was designed to express the LCATl coding 

sequence in the sense orientation from the napin promoter. The construct pCGN9963 was 
designed to express the LCATl coding sequence in the antisense orientation from the 
constitutive promoter CaMV 35S. 

The coding sequence of LCAT2 was cloned from pCGN998i5 as uNoiV &e8387I 

3 0 fragment into pCGN8640, pCGN8641, pCGN8643, and pCGN8644 to create the 

expression constructs pCGN9981, pCGN9982, pCGN9983, and pCGN9984, respectively. 
The construct pCGN9981 was designed to express the LCAT2 coding sequence in the 
sense orientation from the constitutive promoter CaMV 35S. The construct pCGN9982 
was designed to express the LCAT2 coding sequence in flie antisense orientation from the 
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napin promoter. The construct pCGN9983 was designed to express the LCAT2 coding 
sequence in the sense orientation from the napin promoter The construct pCGN9984 was 
designed to express the LCAT2 coding sequence in the antisense orientation from the 
constitutive promoter CaMV 35S. 
5 The coding sequence of LCAT3 was cloned from pCGN9965 as a NotV &c8387I 

fragment into pCGN8640, pCGN8641, pCGN8643, and pCGN8644 to create the 
expression constructs pCGN9966, pCGN9967, pCGN9968, and pCGN9969, respectively. 
The construct pCGN9966 was designed to express the LCAT3 coding sequence in the 
sense orientation from the constitutive promoter CaMV 35S. The construct pCGN9967 

10 was designed to express the LCAT3 coding sequence in the antisense orientation from the 
napin promoter. The construct pCGN9968 was designed to express the LCAT3 coding 
sequence in the sense orientation from the napin promoter. The construct pCGN9969 was 
designed to express the LCAT3 coding sequence in the antisense orientation fix^m the 
constitutive promoter CaMV 35S. 

15 The coding sequence of LCAT4 was cloned from pCGN9995 as zNotV &e8387I 

fragment into pCGN8640, pCGN8641, pCGN8643. and pCGN8644 to create the 
expression constructs pCGN9996, pCGN9997, pCGN9998, and pCGN9999, respectively. 
The construct pCGN9996 was designed to express the LCAT4 coding sequence in the 
sense orientation from the constitutive promoter CaMV 35S- The construct pCGN9997 

2 0 was designed to express the LCAT4 coding sequence in the antisense orientation from the . 

napin promoter. The construct pCGN9998 was designed to express the LCAT4 coding 
sequence in the sense orientation from the napin promoter. The construct pCGN9999 was 
designed to express the LCAT4 coding sequence in the antisense orientation from the 
constitutive promoter CaMV 35S. 
25 The coding sequence of LCAT5 was cloned from pCGN10964 as a NotV &e8387I 

fragment into pCGN9977 and pCGN9979, to create the expression constructs 
pCGN10965, and pCGN10966, respectively. The construct pCGN10965 was designed to 
express the LCAT5 coding sequence in the sense orientation from the constitutive 
promoter CaMV 35S. The construct pCGN10966 was designed to express the LCAT5 

3 0 coding sequence in the sense orientation from the napin promoter. 

The coding sequence of LROl was cloned from pCGN10963 as a Noil/ &e8387I 
fragment into pCGN9977 and pCGN9979, to create the expression constructs 
pCGN10960, and pCGN10961 , respectively. The construct pCGN10960 was designed to 
express the LROJ coding sequence in the sense orientation from the constitutive promoter 
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CaMV 35S. The construct pCGN10961 was designed to express the LROl coding 
sequence in the sense orientation from the napin promoter. 

4D. Plant ACAT Expression Construct Preparation 
5 A fragment containing the Arabidopsis ACAT-like coding region was removed 

from pCGN8626 by digestion with Sse8387I and Not L The fragment containing the 
ACAT-like sequence was ligated into Pstl-Not I digested pCGN8622. The resulting 
plasmid was designated pCGN8627. DNA sequence analysis confirmed the integrity of 
the cloning junctions. 

10 A fragment containing the Arabidopsis ACAT-like coding region (SEQ ID NO: 

42) was removed from pCGN8626 by digestion with Sse8387I and Not I The fragment 
was ligated into Pstl-Not I digested pCGN8623, The resulting plasmid was designated 
pCGN8628. DNA sequence analysis confirmed the integrity of the cloning junctions. 
A fragment containing the Arabidopsis ACAT-like coding region was removed 

15 from pCGN8626 by digestion with Sse8387 and Not L The fragment was ligated into 
Pstl-Not I digested pCGN8624. The resultmg plasmid was designated pCGN&629. DNA 
sequence analysis confirmed the integrity of the cloning junctions. 

A firagment containing the Arabidopsis ACAT-like coding region was removed 
fix>m pCGN8626 by digestion with Sse8387 and Not L The fragment was ligated into 

2 0 Pstl-Not I digested pCGN8625. The resulting plasmid was designated pCGN8630. DNA 
-sequence analysis confirmed the integrity of the cloning junctions. 

An additional expression construct for the suppression of endogenous ACAT-like 
activity was also prepared. The construct pCGN8660 was constructed by cloning 
approximately 1 Kb of the Arabidopsis ACAT-like coding region from pCGN8626 in the 

2 5 sense orientation, and the fiill-length Arabidopsis ACAT-like coding region in the 
antisense orientation under the regulatory control of the napin transcription initiation 
sequence. 

For expression of the rat ACAT-like sequence in plants, the NotI-Sse8387I 
fragment of pCGN8592 was cloned mto Notl-PstI digested binary vectors pCGN8621, 
30 pCGN8622, and pCGN8624 to yield plasmids, pCGN 9700, pCGN9701, and pCGN9702, 
respectively* Plasmid pCGN9700 expresses a sense transcript of the rat ACAT-like cDNA 
under control of a napin promoter^ plasmid pCGN9701 expresses an antisense transcript of 
the rat ACAT-like cDNA under control of a napin promoter, and plasmid pCGN9702 
expresses a sense transcript of the rat ACAT-like cDNA under control of a double 3SS 



S0CX:iD;<WO 01ieaORA2 I > 




wo 01/16308 PCT/USOO/23863 

43 

promoter. Plasmids pCGN 9700, pCGN9701, and pCGN9702 were introduced in 
Agrobacterium tumefaciens EHAl 01. 

Constructs were prepared to direct the expression of the rat AC AT-like sequence in 
the seed embryo of soybean and the endosperm of com. For expression of the rat ACAT- 
5 like DNA sequence in soybean, a 1 .5 kb NotVSseS3Sll fragment from pCGN8592 
containing the coding sequence of the rat ACAT-like sequence was blunt ended using 
Mung bean nuclease, and ligated into the Smal site of the turbo 7S binary/cloning vector 
pCGN8809 to create the vector pCGN8817 for transformation into soybean by particle 
bombardment. The vector pCGN8817 contained the operably linked components of the 

1 0 promoter region of the soybean a* subimit of p-conglycinin (7S promoter, (Chen et aL, 

(1986), Proa Natl Acad, ScL, 83:8560-8564), the DNA sequence coding for the entire rat ) 
ACAT-like protein, and the transcriptional termination region of pea RuBisCo small 
subunit, refeired to as E9 3' (Coruzzi, et al (1984) EMBOJ, 3:1671-1679 and Morelli, et 
ai (1 985) Nature 31 5:200-204). This construct further contained sequences for the 

1 5 selection of positive transformed plants by screening for resistance to glyphosate using the 
CP4 EPSPS (U.S. Patent 5,633,435) expressed under the control of the figwort mosaic 
virus (FMV) promoter (U.S. Patent Number 5,378,619) and the transcriptional termination 
region of E9. 

For expression of the rat ACAT-like sequence in the com endosperm, a 1 .5 kb 
20 A^£>rI/&e8387I fragment from pCGN8592 containing the coding sequence of t^^ 

ACAT-like sequence was blunt ended using Mung bean nuclease, and ligated into the 
BatnHl site of the rice pGtl expression cassette pCGN8592 for expression from the pGtl . ) 
promoter (Leisy, D J, et al.. Plant Mol Biol. 14 (1989) 41-50) and the HSP70 intron 
sequence (U.S. Patent Number 5,593,874). This cassette also included the transcriptional 

2 5 termination region downstream of the cloning site of nopaline synthase, nos 3 ' (Depicker 

et al., J. Molec, AppL Genet, (1982) 1 : 562-573). A 7.5 kb fragment containing the pGtl 
promoter, the DNA sequence encoding the rat ACAT-like protein, and the nos 
transcriptional termination sequence was cloned into the binary vector pCGN8S16 to 
create the vector pCGN8818 for transformation into com. This constmct also contained . 

3 0 sequences for the selection of positive transformants with kanamycin using the kanamycin 

. resistance gene from Tn5 bacteria under the control of the CAMV 35S promoter and tml 
transcriptional termination regions. 
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Example 5: Expression in Insect Cell Culture 

A baculovirus expression system was used to express the LCAT cDNAs in 
cultured insect cells. 

The baculovirus expression constructs pCGN9992, pCGN9993, pCGN9994, 
5 pCGNl 0900, pCGNl 0962, and pCGNl 0967 were transformed and expressed using the 
BAC-to-BAC Baculovirus Expression System (Gibco-BRL, Gaithersburg, MD) according 
to the manufacturer's directions. 

The transformed insect cells were used to assay for acyltransferase activities using 
methods known in the art (see Example 8). 

10 

Example 6: Plant Transformation 

A variety of methods have been developed to insert a DNA sequence of interest into 
the genome of a plant host to obtain the transcription or transcription and translation of the 
sequence to effect phenotypic changes. Transgenic plants were obtained by Agrobacterium" 
15 mediated transformation as described by Radke et al (TheoK AppL GeneL (198&) 75:685-694; 
Plant Cell Reports (1992) 77:499-505), Alternatively, microprojectile bombardment methods, 
such as described by Klein et al {Bio/Technology 70:286-291) may also be used to obtain 
nuclear transformed plants. Other plant species may be similarly transformed using related 
techniques. 

2 0 The plant binary constructs described above were used in plant transformation to direct 

the expression of the sterol acyltransferases in plant tissues. Binary vector constructs were 
transformed into strain EHAlOl Agrobacterium cells (Hood et aLy J. Bacteriol (1986) 
7(55:1291-1301), by the method of Holsters et aL (MoL Gen, GeneL (1978) 755:181-187). 
Tronsgtmc Arabidopsis thaliana plants were obtained )dY Agrobacterium^mtdxdXtd, 

25 transformation as described by Valverkens et aL, (Proc. Nat Acad ScL (1988) 55:5536-5540), 
Bent et al. ((1994), Science 265:1856-1860), and Bechtold et al. ((1993), C. R.Acad: Set, Life 
Sciences 316:1194-1199). 

Example 7: Plant Assays for Modified Sterol Content/Profile 
30 7A: NMRofT2seed 

Seed from plants expressing LCAT 1 through 4 under the control of the napin 
promoter were analyzed by NMR. Arabidopsis seeds from transgenic plants were placed 
directly into wide-mouth MAS NMR sample tubes. 
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High-resolution spectra were measured at 1 1.7 T (1H=500 MHz, 13C=125 mHz) 
using Varian NMR Instruments (Palo Alto, CA) Inova^^ NMR spectrometers equipped 
with carbon-observe MAS Nanoprobes™, The 13C spectra were acquired without a field- . 
frequency lock at ambient temperature (approx. 21-22**C) for 14 hours using the following 
5 conditions: spectral width - 29,996 kHz. acquisition time = 2.185 seconds, p/2 pulse (3.8 
ms) with no relaxation delay, IH g B2 = 2.5 kHz with Waltz decoupling. Data processing 
conditions were typically: digital resolution = 0.1 1 Hz, 03 to 1.5 Hz line broadening and 
time-reversed linear prediction of the first three data points. Chemical shifts were 
referenced by adding neat tetramethylsilane (TMS) to Arabidopsis seeds and using the ^ 

10 resulting referencing parameters for subsequent spectra. The 13C resolution was 2-3 Hz 
for the most narrow seed resonances. Spectral resolution was independent of MAS 
spinning speeds (1 .5-3.5 kHz) and data were typically obtained with 1 .5 kHz spinning 
speeds. Spinning sidebands were approx. 1% of the main resonance. Phytosterol 13C 
assignments were based on model samples composed of triolein^ p-sitosterol and 

1 5 cholesterol bleate. Triacylglycerol 13C assignments were made from comparison with 
literature assignments or with shifts computed from a 13C NMR database (Advanced 
Chemical Development, Inc., version 3.50, Toronto Canada). 

The results of these analyses are displayed in Figure 2 and show that there was a 
trend of an approximately 2 fold increase of phytosterols in the seeds derived from plant 

20 line 5 expressing the LC AT 4 gene (pCGN9998) under the control of the napin promoter. 
During the course of this analysis it was also noted that the average oil content of seed 
from plants expressing the LCAT2 construct (pCGN9983) under the control of the napin 
promoter was higher than that of controls. This is the first in planta evidence supporting 
the concept that overexpression of a nucleotide sequence encoding a lecithin:cholesterol 

25 acyltransferase-like polypeptide can mcriease oil content. 

7B:HPLC/MSofT2seed 

Seed oil from T2 plants expressing LCATl through 4 under the control of the 
napin promoter was extracted using an accelerated solvent extractor (ASE) method. Seed 
3 0 samples were ground, using a mortar and pestle, to achieve a fine homogeneous meal. Oil 
was obtained using a Dionex Accelerated Solvent Extractor (ASE). Clean ground seed 
was added to an equal amount of diatomaceous earth. The ground seed sample and the 
diatomaceous earth were thoroughly mixed until a homogeneous texture was achieved. 



wo 01/16308 



46 



PCT/USOO/23863 



The sample was then loaded into the instrument and oil extraction was achieved using 
hexane imder validated laboratory protocols. 

Oil ftom these seed samples was then analyzed for sterol ester analysis using 
HPLC/MS for free campesterol, stigmasterol, and sitosterol and their fetty acid esters. To 
5 the autosamplCT vial containing approximately 0.1 grams oil was added 0.3 mLs CDClj. 
One-hundred microliters of this solution was added to 90O microliters CHCl,. Five 
microliters of this diluted sample was subsequently injected into an HPLCVMS with 
positive ion atmospheric pressure ionization. The individual components in the oils were 
separated using two 4.6 x 50 mm Cj ZoAax columns in series and a gradient using 

1 0 acetonitrile and acetonitrile with 40% CHClj. The sterol concentrations were calculated 
assuming each sterol and its fatty acids have the same molar responses. This was observed 
to be the case with cholesterol and its esters and was assumed to be the case for 
campesterol, stigmasterol, and sitosterol. In the present study, the sterol identified as 
stigmasterol was actually an isomer of this compound. 

15 The results of these analyses are displayed in Figures 3 and 4 and shovr, that there 

were sterol ester enhancements on the order of 50%. in the seeds derived firom six out of 
seven T2 plant lines expressing LCAT3 (pCGN9968) under the control of the napin 
promoter. 

2 0 Example 8: Baculovirus Insect Cell Culture for Sterol Esterification Activity 

Baculovirus expression constract pCGN9992, pCGN9993, pCGN9994 and 
pCGN10900 (see Example 4) were transformed and expressed using &e BAC-TOBAC 
Baculovirus Expression System (Gibco-BRL, Gaithersburg, MD) according to the 
manufacturer's instructions except harvesting of recombinant viruses was done 5 days 

2 5 post-transfection. The supernatant from the transfection mixture was used for generating 

virus stock which in turn was used for infecting S& cells used in the assay. 

The transformed cells were assayed for lecithimsterol acyltransferase activities 
using the method described herein. Insect cells were centrifuged and the resulting cell 
pellet was either used immediately or stored at -70 C for later analysis. CeUs were 

3 0 resuspended in Medium A (1 00 mM Tricine/NaOH, pH 7.8, 10% (w/v) glycerol, 280 mM 

NaCl with : 0.1 jiM Aprotinin, 1 \iM Leupeptin, and 100 |jM Pefebloc (all from 
Boehringer Mannheim, Gemiany) and lysed by sonication (2 x 10 sec). Cell walls and 
other debris were pelleted by centrifugation (14,000 x g , 10 miii, 4"'C). The supernatant 
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was transferred to a new vial and membranes pelleted by centrifugation (100,000 x g, Ti 
70.1 rotor, 46,000 rpm for 1 hour at 4^C). Total membranes were resuspended in Medium 
A. Lecithin:sterol acyltransferase activity was assayed in a 0.1 ml reaction mixture 
containing 100 mM Tris/HCl. pH 7, 28 mM NaCl, 0.03% Triton X-100, 0,1 mM sitosterol, 
5 • 20 jiM 1 ,2-[*^Cl-palmitoyl-phosphatidyl choline (246420 dpm/nmole), and 0.05-20 mg of 
membrane protein. After 15 minutes at 30 "^C, the reaction was terminated by addition of a 
0.5 ml solution of methylene chloride rmethanol (4:1, v/v ) containing 100 \ig cholesterol 
and cholesterol ester as cold carriers. A portion (0.1 ml) of the bottom organic layer was 
removed and evaporated under nitrogen gas. The concentrated extract was resuspended in 

10 30 ^1 of hexane and spotted onto a silica gel-G thin layer chromatographic plate. The 
plate was migrated in hexane:diethyl ethenacetic acid (80:20:1) to the top, then air dried. 
Radioactivity was determined by exposure to a Low Energy Phosphor-imaging Screen. 
Following exposure, the screen was read on a phosphorimager. 

The LCAT 4 protein from pCGN10900 in baculovirus membranes showed a 

1 5 radioactive spot in the region of the TLC plate where cholesterol ester migrates indicating 
that LCAT 4 has the ability to catalyze the transfer of an acyl group from lecithin (PC) to 
sitosterol to make a sitosterol ester. 

Example 9: Plant Assay for Modified Lipid Content 

2 0 Nir (near infrared spectroscopy spectral scanning) can be used to determine the 

total oil content of Arabidopsis seeds in a non-destmctive way provided that a spectral 
calibration curve has been developed and validated for seed oil content. A seed oil 
spectral calibartion curve was developed using seed samples from 85 Arabidopsis plants* 
Seed was cleaned and scanned using a Foss NIR model 6500 CFoss-Nirs Systems, Inc.). 

2 5 Approximately 50 to 1 00 milligrams of whole seeds, per sample, were packed in a mini 
sample ring cup with quartz lens [ IH-0307 ] consisting a mini-insert [ IH-0337 ] and 
scanned in reflectance mode to obtain the spectral data. The seed samples were then 
ground, using a mortar and pestle, to achieve a fine homogeneous meal. The ground 
samples were measured for oil using an accelerated solvent extractor (ASE), 

30 Measurement for the total oil content was performed on the Dionex Accelerated 

Solvent Extractor (ASE). Approximately 500 mg of clean ground seed was weighed to the 
nearest 0.1 mg onto a 9 x 9 cm weigh boat. An equal amount of diatomaceous earth was 
added using a top-loading balance accurate to the nearest 0.01 g* The ground seed sample 
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and the diatomaceous earth were thoroughly mixed until a homogeneous texture was 
achieved. The sample was loaded on to the instrument and oil extraction was achieved 
using hexane under validated laboratory protocols. Standard Rapeseed samples were 
obtained from the Community Bureau of Reference (BCR). The ASE extraction method 
5 was validated using the BCR reference standards. A total percent oil recovery of 99% to 
100% was achieved. **As-is" oil content was calculated to the nearest 0.01 mass 
percentage using the formula: 

Oil Content = 100% x (vial plus extracted oil wt - initial vial wt) / (sample wt) 

10 

The analytical data generated by ASE were used to perform spectral calibrations. 
Nir calibration equations were generated using the built-in statistical package within the 
NirSyteras winisi software. The spectral calibration portion of the software is capable of 
calibration and self-validation. From a total of 85 samples, 57 samples were used to 

1 5 generate the total percent oil calibration. The remaining smiples were used to validate the 
oil calibrations. Optimized smoothing, derivative size, and mathematical treatment 
(modified partial least square) was utilized to generate the calibration. The samples that 
were not used in building respective calibrations were used as a validation set. Statistical 
tools such as correlation coefBcient ( R ), coefficient of determination (R^), standard enror 

2 0 of prediction ( SEP ), and the standard error of prediction corrected for bias (SEPC) were 
used to evaluate the calibration equations. 

T2 seeds from plants that had been transformed with the LCAT genes were cleaned 
and scanned using a Foss NIR model 6500 (Foss-Nirs Systems, Inc.). Approximately 50 to 
100 milligrams of whole seeds, per sample, were packed in a mini sample ring cup with 

2 5 quartz lens [ IH-0307 ] consisting a mini-msert [ IH-0337 ] and scanned in reflectance 

mode to obtain the spectral data. Oil percentage m each seed sample was detennined 
using the seed oil spectral calibration curve detailed above. 

The results of these analyses are found in Figure 5 and Table 2 and show that there 
was a significant increase in the oil level in seed from T2 plants expressing the LCAT2 

3 0 gene. This increase in oil was seen in plants when LCAT2 was driven by either the 35S 

constitutive promoter or the seed-specific napin promoter. These results show that 
overexpression of a nucleic acid sequence encoding a lecithmxholesterol acyltransferase- 
like polypeptide can increase seed oil production in plants. 
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Table 2 





Construct number 


Seed Oil Percentage (%) 


CONTROL 




24.7 


CONTROL 




28.0 


CONTROL 




31.8 


CONTROL 




32.4 


NAPIN LCATl 


PCGN9962 


28.5 


NAPINLCATl 


PCGN9962 


28.9 


NAPIN LCATl 


PCGN9962 


29.6 


NAPIN LCATl 


PCGN9962 


30.1 


NAPINLCATl 


PCGN9962 


30.1 


NAPINLCATl 


PCGN9962 


30.1 


NAPINLCATl 


PCGN9962 


30.8 


NAPINLCATl 


PCGN9962 


31.0 


NAPINLCATl 


pCGN9962 


32.1 


NAPINLCATl 


pCGN9962 


34.2 


NAPIN LCAT3 


pCGN9968 


26.8 


NAPIN LCAT3 


pCGN9968 


27.4 


NAPIN LCAT3 


pCGN9968 


29.0 


NAPIN LCAT3 


pCGN9968 


29.0 


NAPIN LCAT3 


pCGN9968 


32.6 


NAPIN LCAT2 


pCGN9983 


26.5 


NAPIN LCAT2 


pCGN9983 


34.7 


NAPIN LCAT2 


pCGN9983 


34.8 


NAPIN LCAT2 


pCGN9983 


35.7 


NAPIN LCAT2 


pCGN9983 


35.8 


NAPIN LCAT2 


pCGN9983 


36.3 


NAPIN LCAT2 


pCGN9983 


36.7 


NAPIN LCAT2 


pCGN9983 


37.0 


NAPIN LCAT2 


pCGN9983 


37.2 


NAPIN LCAT2 


pCGN9983 


37.3 


NAPIN LCAT2 


pCGN9983 


37.3 


NAPIN LCAT2 


pCGN9983 


37.4 


NAPIN LCAT2 


pCGN9983 


37.8 


NAPIN LCAT2 


pCGN9983 


38.0 


NAPIN LCAT2 


• pCGN9983 


38.0 


35S LCAT2 


pCGN9981- 


27.3 


35S LCAT2 


pCGN9981 


28.1 


35S LCAT2 


pCGN9981 


28.2 


35S LCAT2 


pCGN9981 


28.6 


35S LCAT2 


pCGN9981 


29.8 


35S LCAT2 


pCGN9981 


30.3 


35S LCAT2 


pCGN9981 


32.4 


35S LCAT2 


pCGN9981 


32.5 


35S LCAT2 


• pCGN9981 


33.6 


35S LCAT2 


pCGN9981 


34.1 


35S LCAT2 


pCGN9981 


35.5 


35S LCAT2 


pCGN9981 


36.4 


35S LCAT2 


pCGN9981 


37.1 


35S LCAT2 


PCGN9981 ' 


38.3 
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35S LCAT2 


pCGN9981 


38.5 


35S LCAT2 


PCGN9981 


39.1 



In light of the detailed description of the invention and the examples presented 
above, it can be appreciated that the several aspects of the invention are achieved. 
5 It is to be understood that the present invention has been described in detail by way 

of illustration and example in order to acquaint others skilled in the art with the invention, 
its principles, and its practical application. Particular formulations and processes of the 
present invention are not limited to the descriptions of the specific embodiments, 
presented, but rather the descriptions and examples should be viewed in trans of the 

1 a claims that follow and their equivalents. While some of the examples and descriptions 
above include some conclusions about the way the invention may function, the inventors 
do not intend to be bound by those conclusions and functions, but put them forth only as 
possible explanations. 

It is to be further understood that the specific embodiments of the present mvention 

15 as set forth are not intended as being exhaustive or limiting of the invention, and that many 
alternatives, modifications, and variations will be apparent to those of ordinary skill in the 
art in light of the foregoing examples and detailed description. Accordingly, this invention 
is intended to embrace all such alternatives, modifications, and variations that fall within 
the spirit and scope of the following claims. 
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What is claimed is: 

1 . An isolated nucleic acid sequence comprising a polynucleotide encoding a plant 
lecithinxholesterol acyltransferase-like polypeptide or fragment thereof. 

2. The isolated nucleic acid sequence of claim 1, wherein said plant lecithinxholesterol 
acyltransferase-like polypeptide is selected from the group consisting ofArabidopsis, 
soybean and com. 

3. An isolated nucleic acid sequence comprising a polynucleotide encoding a plant acyl 
CoAxholesterol acyltransferase-like polypeptide. 

4. The isolated nucleic acid sequence of claim 3, wherein said polynucleotide is SEQ ID 
NO: 42 or degeneiate variants thereof. 

5. The isolated nucleic acid sequence of claim 1, wherein said polynucleotide is selected 
from the group consisting of SEQ ID NO: 2, 4. 6, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 
19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 43, 44, 45. 46, 47, 48, 49, 50, 51, 73 and 75 
or degenerate variants thereof. 

6. An isolated nucleic acid sequence consisting essentially of SEQ ID NO: 2, 4, 6, 8, 10, 
1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 42. 43, 44, 45, 
46, 47, 48, 49. 50, 51, 73 or 75. 

7. An isolated nucleic acid sequaice consisting of SEQ ID NO: 2, 4, 6, 8, 10, 1 1, 12, 13, 
14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25. 26. 27, 28. 29, 42, 43. 44, 45, 46, 47, 48, 
49, 50, 51, 73 or 75. 

8. An isolated nucleic acid sequence conq>rising a polynucleotide selected from the 
group consistmg of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO 3 or SEQ ID 
NO 3 with at least one conservative amino acid substitution; 
5 b) SEQ ID NO: 2; 

c) an isolated polynucleotide that has at least 70% sequence identity to SEQ ID 

NO: 2; 
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d) an isolated polynucleotide that has at least 80% sequence identity to SEQ ID 
NO: 2; 

e) an isolated polynucleotide that has at least 90% sequence identity to SEQ ID 
NO: 2; 

f) an isolated polynucleotide that has at least 95% sequence identity to SEQ ID 
NO: 2; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 2; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 2 and encodes a plant lecithin:cho}esterol acyltransferase-like 
polypeptide. 

9. An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5* X-(Ri)^-(R2)„-(R3)„-Y 3', where X is hydrogen, Y is hydrogen or rmetal, 
R, and are any nucleic acid, n is an integer between 0-3000, and Rj is selected from 
the group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 3 or SEQ 
ID NO: 3 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 2; • 

c) an isolated polynucleotide that has at least 70% sequence identity to SEQ ID 
NO: 2; 

d) an isolated polynucleotide that has at least 80% sequence identity to SEQ ID 
NO: 2; 

e) an isolated polynucleotide that has at least 90% sequence identity to SEQ ID 
NO: 2; 

f) an isolated polynucleotide that has at least 95% sequence identity to SEQ ID 
NO: 2; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ED NO: 2; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(eX(f)or(g);and 
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i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 2 and encodes a plant lecithinrcholesterol acyltransferase-like 
polypeptide. 



10. An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 5 or SEQ 
ID NO: 5 with at least one conservative amino acid substitution; 
5 b) SEQ ID NO: 4; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 4; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 4; 

10 e) an isolated polynucleotide having at least 90% sequence identity with SEQ ED 

NO: 4; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 4; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
15 stringent conditions to SEQ ED NO: 4; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 4 and encodes a plant lecithin:cholesterol acyltransferase-like 

20 polypeptide. 



11. An isolated nucleic acid sequence consistmg essentially of a polynucleotide of the 
formula 5' X-(R,)a-(R2X-(R3)„-Y 3', where X is hydrogen, Y is hydrogen or a metal, 
R, and Rj are any nucleic acid, n is an integer between 0-3000, and is selected from 
the group consisting of: 

a) a] 1 isolated polynucleotide encoding a polypeptide of SEQ ED NO: 5 or SEQ 
JD NO: 5 with at least one conservative amino acid substitution; 

b) SEQ ED NO: 4; 
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c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 4; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 4; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 4; ' 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 4; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 4; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 4 and encodes a plant lecithintcholesterol acyltransferase-like 
polypeptide. 

12. An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ED NO: 7 or SEQ 
ID NO: 7 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 6; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 6; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 6; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 6; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 6; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 6; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f)or(g>;and 
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i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 6 and encodes a plant Iecithin:cholesterol acyltransferase-like 
polypeptide. 

13, An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5' X-(R,)„-(R2)„-(R3)n-Y 3\ where X is hydrogen, Y is hydrogen or a metal, 
Ri and Rj are any nucleic acid, n is an integer between 0-3000, and is selected from 
the group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 7 or SEQ 
ID NO: 7 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 6; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 6; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
N0:6; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 

NO: 6; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 6; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 6; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),®or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 6 and encodes a plant lecithin:cholesterol acyltransferase-like 
polypeptide. 

14. An isolated nucleic acid sequence comprising a polynucleotide selected jfrom the 
group consisting of 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 9 or SEQ ID 
NO: 9 with at least one conservative amino acid substitution; 

b) SEQ ID NO 8; 
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c) an isolated polynucleotide having at least 70% sequence idaitity with SEQ ID 
NO: 8; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 8; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 8; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 8; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes mdst 
stringent conditions to SEQ ID NO: 8; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), (d), 
(e),(Oor(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ ID 
NO: 8 and encodes a plant lecithin:cholesterol acyltransferase-like polypeptide. 

15. An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5' X-(R,V(R2)„-(R,)n-Y 3', where X is hydrogen, Y is hydrogen or a metal, 
R, and R, are any nucleic acid, n is an integer between 0-3000, and Rj is selected from 
the group consisting of: 



a) 


an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 9 or SEQ 




ID NO: 9 with at least one conservative amino acid substitution; 


b) 


SEQ ED NO: 8; 


c) 


an isolated polynucleotide having at least 70% sequence identity with SEQ ED 




NO: 8; 


d) 


an isolated polynucleotide having at least 80% sequence identity with SEQ ID 




NO: 8; 


e) 


an isolated polynucleotide having at least 90% sequence identity with SEQ ID 




NO: 8; 


f) 


an isolated polynucleotide having at least 95% sequence identity with SEQ ID 




NO: 8; 


g) 


an isolated polynucleotide of at least 10 nucleic acids that hybridizes undar 



stringent conditions to SEQ ID NO: 8; 
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h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(eX(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 8 and encodes a plant lecithlnxholesterol acyltransferase-like 
polypeptide. 

>. An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 74 or SEQ 
ID NO: 74 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 73; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 73; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 73; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 73; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 73; 

g) . an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 

stringent conditions to SEQ ID NO: 73; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(dX(e).(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 73 and encodes a plant lecithin:cholesterol acyltransferase-like 
polypeptide. 

h An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5' X-(Rj)„-(R2)n-(R3)ii-Y 3', where X is hydrogen, Y is hydrogen or a metal, 
Ri and are any nucleic acid, n is an integer between 0-3000, and is selected from 
the group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 74 or SEQ 
ID NO: 74 with at least one conservative amino acid substitution; 



WO0iyj6308 PCT/US00/238(>3 

58 

b) SEQIDNO:73; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQE) 
NO: 7S; 

10 d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 

NO: 73; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 73; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
15 NO: 73; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 73; 

) h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 

(d),(e).(f)or(g);and 

20 i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 

ID NO: 73 and encodes a plant lecithin:cholesterol acyltransferase-Uke 
polypeptide. 

18, A isolated nucleic acid sequence comprising a polynucleotide selected from the group 

consisting of: . 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 76 or SEQ 
ID NO: 76 with at least one conservative amino acid substitution; 
5 b) SEQ ID NO: 75; 

. ) c) an isolated polynucleotide having at least 70% sequence identity with- SEQ ID 

NO: 75; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 75; 

10 e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 

NO: 75; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 75; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
1 5 stringent conditions to SEQ ID NO: 75 ; 
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h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d), (e), (f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 75 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

19, An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5' X-(Rt)„-(R2)n-(R3)n-Y 3% where X is hydrogen, Y is hydrogen or a metal, 
Ri and R3 are any nucleic acid, n is an integer between 0-3000, and Rj is selected from 
the group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 76 or SEQ 
ID NO: 76 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 75; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 75; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 75; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 

NO: 75; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 75; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 75; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 75 and encodes a plant lecithin:cholesterol acyltransferase-like 
polypeptide. 



20, 



An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of: 

a) SEQ ID NO: 42 or a degenerate variant thereof; 
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an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 42; 

an isolated polynucleotide having at least 80% sequence identity with SEQ ID 

NO: 42; 

an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 42; 

an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 42; 

an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 42; ^ 
an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),or(f);and 

an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 42 and encodes an acyl CoAicholesterol acyltransferase-like 
polypeptide. 

21. An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 

formula 5' X'(RX'(^i)A'^)rr^ where X is hydrogen, Y is hydrogen or a metal R, 
and R3 are any nucleic acid, n is an integer between 0 and 3000, and is selected 
from the group consisting of: 

a) SEQ ID NO: 42 or degenerate variants thereof; 

b) an isolated polynucleotide having at least 70% sequence identity to SEQ ID 
NO: 42; 

c) an isolated polynucleotide having at least 80% sequence identity to SEQ ID 
NO: 42;. 

d) an isolated polynucleotide having at least 90% sequence identity to SEQ ID 
NO: 42; 

e) an isolated polynucleotide having at least 95% sequence identity to SEQ ID 
NO: 42; 

f) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes, under 
stringent conditions to SEQ ID NO: 42; 

g) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),or(£);and 



WO 01/163(W 
b) 

c) 

d) 

e) 

f) 

g) 

h) 
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h) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 42 and encodes an acyl CoArcholesterol acyltransferase-like 
20 polypeptide, 

22. A recombinant nucleic acid construct comprising a regulatory sequence operably 
linked to polynucleotide encoding a lecithinxholesterol acyltransferase-like 
polypeptide or a fragment thereof. 

23. The recombinant nucleic acid construct of claim 22, wherein said lecithinxholesterol 
acyltransferase-like polypeptide is a plant lecithinxholesterol acyltransferase-like 
polypeptide. 



24. A recombinant nucleic acid construct comprising a regulatory sequence operably 
linked to a polynucleotide encoding an acyl CoAxhblesterol acyltransferase-like 
polypeptide, 

25. The recombinant nucleic acid construct of claim 24, wherein said acyl 
CoAxholesterol acyltransferase-like polypeptide is a plant acyl CoAxholesterol 
acyltransferase-like polypeptide. 

26. The recombinant construct of claim 22, wherein said regulatory sequence comprises a 
heterologous regulatory sequence- 

27. The recombinant construct of claim 24, wherein said regulatory sequence comprises a 
heterologous regulatory sequence. 

28. The recombinant construct of claim 22, wherein said regulatory sequence is functional 
in a plant cell. 

29. The recombinant construct of claim 24, wherein said regulatory sequence is functional 
in a plant cell. 



30. 



The recombinant construct of claim 22, further comprising a tCTninatioh sequence. 
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3 1 . The recombinant construct of claim 24 further comprising a termination sequence. 

32. The recombinant construct of claim 22 wherein said polynucleotide is selected from 
the group consistingof SEQ IDNO: 2,4, 6, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 
20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 43, 44, 45, 46, 47, 48, 49, 50, 51, 73 and 75. 

33. The recombinant construct of claim 24, wherein said polynucleotide is selected from 
the group consisting of SEQ ID NO: 33 and 42. 

34. The recombinant construct of claim 22, wherein said regulatory sequence comprises a 
) constitutive promoter. 

3 5. The recombinant construct of claim 24, wherein said regulatory sequence comprises a 
constitutive promoter. 

36. The recombinant construct of claim 22, wherein said regulatory sequence comprises 
an inducible promoter. 

37. The recombinant construct of claim 24, wherein said regulatory sequence comprises 
an inducible promoter. 

) 38. The recombinant construct of claim 22, wherein: said regulatory sequence is selected 
from the group consisting of a tissue specific promoter, a developmentally regulated 
promoter, an organelle specific promoter, and a seed specific promoter. 

39. The recombinant construct of claim 24, wherein said regulatory sequence is selected 
from the group consisting of a tissue specific promoter, a developmentally regulated 
promoter, an organelle specific promoter, and a seed specific promoter. 



40. 



A host cell containing the recombinant construct of claim 22 or 24. 
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41 . The host cell of claim 40, wherein said host cell is selected from the group consisting 
of plant cells, animal cells, insect cells, yeast, bacteria, bacteriophage and vuiises. 



42. The host cell of claim 40, wherein said host cell is a plant cell. 

43. The host cell of claim 40, wherein said host cell expresses a lecithin:cholesterol 
acyltransferase-like polypeptide or an acyl CoA.cholesterol acyltransferase-like 
polypeptide, 

44. The host cell of claim 43, wherein said cholesterol acyltransferase-like polypeptide is 
a plant acyltransferase-like polypeptide. 

45. A plant comprising at least one host cell of claun 40. 

46. The progeny of a plant of claim 45. 

47. A seed from the plant of claim 45. 

48. A plant comprising the recombinant construct of claim 22 or 24. 

49. The progeny of a plant of claim 48. 

50. A seed from the plant of claim 48. 

51. A purified polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 3, SEQ ID NO: 3 with at least one conservative amino acid 

' substitution, SEQ ID NO: 5, SEQ ID NO: 5 with at least one conservative amino acid 
substitution, SEQ ID NO: 7, SEQ ID NO: 7 with at least one conservative amino acid 
substitution, SEQ ID NO: 9, SEQ ID NO: 9 with at least one conservative amino acid 
substitution, SEQ ID NO: 74, SEQ ID NO: 74 with at least one conservative amino 
acid substitution, SEQ ID NO: 76 and SEQ ID NO: 76 widx at least one conservative 
amino acid substitution. 
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52. A purified immunogenic polypq)tide comprising at least 1 0 consecutive amino acids 
from an amino acid sequence selected from the group consisting of SEQ ID NO: 3, 
SEQ ro NO: 3 with at least one conservative amino acid substitution, SEQ ID NO: 5, 
SEQ ID NO: 5 with at least one conservative amino acid substitution, SEQ ID NO: 7, 
SEQ ID NO: 7 with at least one conservative amino acid substitution, SEQ ID NO: 9, 
SEQ ID NO: 9 with at least one conservative amino acid substitution, SEQ ID NO: 
74, SEQ ID NO: 74 with at least one conservative amino acid substitution, SEQ ID 
NO: 76 and SEQ ID NO: 76 with at least one conservative amino acid substitution. 

53. An antibody which specifically binds to an immunogenic polypeptide of claim 52. 

54. A method for producing a lecithinrcholesterol acyltransferase-like polypeptide or an 
acyl CoA:choIesteroI acyltransferase-like polypeptide comprising culturing a host cell 
of claim 40 under conditions permittuig expression of said lecithin:cholesterol 
acyltransferase-like polypeptide or acyl CoA:cholesterol acyltransferase-like 
polypeptide. 

55. The method of claim 54, further comprising isolatmg the cholesterol acyltransferase- 
like polypeptide from the host cell or from the medium in which the host cell is 
cultxu'ed. 

56. A method for modifying the sterol content of a host cell, comprising transforming a 
host cell with a recombinant construct containing a regulatory sequence operably . 
linked to a polynucleotide encodmg a lecithin:cholesterol acyltransferase-like 
polypeptide and culturing said host cell under conditions wherein said host cell 
expresses a lecithinrcholesterol acyltransferase-like polypeptide such that said host 
cell has a modified sterol composition as compared to host cells without the 
recombinant construct 



The method of claim 56, wherein said lecithin:cholesterol acyltransferase-like 
polypeptide is a plant lecithin:cholesterol acyltransferase-like polypeptide. 




WO01/J6308 PCT/USOO/23863 

65 

58, A method for modifying the sterol content of a host cell, comprising transforming a 
host cell with a recombinant construct containing a regulatory sequence operably 
linked to a polynucleotide encoding an acyl CoAxholesterol acyltransferase-like 
polypeptide and culturing said host cell under conditions wherein said host cell 
5 expresses an acyl CoA:cholesterol acyltransferase-like polypeptide such that said host 

cell has a modified sterol composition as compared to host cells without the 
recombinant construct. 



59. The method of claim 58, wherein said acyl CoAxholesterol acyltransferase-like 
polypeptide is a plant acyl CoAxholesterol acyltransferase-like polypeptide. 

60. The method of claim 56, wherein said modified sterol composition is an increase in 
sterol esters, 

61 . The method of claim 58, wherein said modified sterol composition is an increase in 
sterol esters. 



62. The method of claim 56, wherein said polynucleotide encoding a lecithinxholesterol 
acyltransferase-like polypeptide is selected from the group consisting of SEQ E> NO: 
2,4,6,8,10, 11, 73 and 75. 

63, The method of claim 58 wherein said polynucleotide encoding a acyl CoAxholesterol 
acyltransferase-like polypeptide is SEQ ID NO 33 OR 42. 



64. The method of claim 56, wherein said regulatory sequence comprises a constitutive 
promoter. 

65. The method of claim 58, wherein said regulatory sequence comprises a constitutive 
promoter. 

66. The method of claim 56, wherein said regulatory sequence is an inducible promoter. 



67. 



The method of claim 58, wherein said regulatory sequence is an inducible promoter. 
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68. The method of claim 56, wherein said regulatory sequence is a tissue specific 
promoter. 

69. The method of claim 58, wherein said regulatory sequence is a tissue specific 
promoter 

70. The method of claim 56, wherein said regulatory sequence is. a seed specific promoter. 

71 . The method of claim 58, wherein said regulatory sequence is a seed specific promoter, 

72. The method of claim 56, wherein said polynucleotide encoding a lecithinrcholesterol 
acyltransferase-like polypeptide is in the antisense orientation. 

73. The method of claim 58, wherein said polynucleotide encoding an acyl 
CoAxholesterol acyltransferase-like polypeptide is in the antisense orientation. 

74. The method of claim 72, wherein said modified sterol composition is a decrease in 
sterol esters. 

75. The method of claim 73, wherein said modified sterol composition is a decrease in 
sterol esters, 

76. A plant comprising a recombinant construct containing a regulatory sequence 
operably linked to a polynucleotide encoding a lecifliinxholesterol acyltransferase- 
like polypeptide wherein expression of said recombinant construct results in modified 
sterol composition of said plant as compared to the same plant without said 
recombinant construct. 

77. The plant of claim 76, wherein said leci thinrcholesterol acyltransferase-like 
polypeptide is a plant lecithin:cholesterol acyltransferase-like polypeptide. 
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78. The plant of claim 76, wherein said polynucleotide encoding a lecithinxholesterol 

acyltransferase-like polypeptide is selected from the group consisting of SEQ ID NO: 
2,4, 6, 8, 10, 11, 73 and 75. 



79. A plant comprising a recombinant construct containing a regulatory sequence 
operably linked to a polynucleotide encoding an acyl CoAicholesterol acyltransferase- 
like polypeptide wherein expression of said recombinant construct results in modified 
sterol composition of said plant as compared to the same plant without said 

5 recombinant construct 

80. ' The plant of claim 79, wherein said acyl CoA:cholesterol acyltransferase-like 
polypeptide is a plant acyl CoAxholesterol acyltransferase-like polypeptide. 

81 . The plant of claim 79, wherein said polynucleotide encoding an acyl CoAxholesterol 
acyltransferase-like polypeptide is SEQ ID NO: 33 or 42. 



82. The plant of claim 76, wherein said regulatory sequence comprises a tissue specific 
promoter. 

83. The plant of claim 79, wherein said regulatory sequence comprises a tissue specific 
promoter. 

84. The plant of claim 76, wherein said regulatory sequence comprises a seed specific 
promoter, 

85. The plant of claim 79, wherein said regulatory sequence comprises a seed specific 

*> 

promoter- 

86. The plant of claim 76, wherein said modified sterol composition is an increase in 
sterol esters. 



87. 



The plant of claim 79, wherein said modified sterol composition is an increase in 
sterol esters. 



wo 01/16308 PCTAJSOO/23863 

68 



88. The plant of claim 76, wherein the polynucleotide encoding a lecithin:cholesterol 
acyltransferase-like polypeptide is in the antisense orientation. 

89. The plant of claim 79, wherein the polynucleotide encoding an acyl toAxholesterol 
acyltransferase-like polypeptide is in the antisense orientation. 

.» 

90. An oil obtained from the plant of claim 76 or 79. 

91. A method for producing an oil with a modified sterol composition comprising, 
providing a plant of claim 76 or 79 and extracting the oil from said plant. 

92. An oil produced by the method of cldm 91. 

93. A method for altering oil production by a host cell comprising, transforming a host 
cell with a recombinant construct containing a regulatory sequence operably linked to 
a polynucleotide encoding a lecithinrcholesterol acyltransferase-like polypeptide and 
culturing said host cell under conditions wherein said host cell expresses a 
lecithinxholesterol acyltransferase-like polypeptide such that said host cell has an 
altered oil production as compared to host cells without the recombinant construct. 

94. The method of claim 93, wherein said lecithinxholesterol acyltransferase-like 
polypeptide is a plant lecithinxholesterol acyltransferase-like polypeptide. 

95 . A method for altering oil production by a host cell comprising, transforming a host 
cell with a recombinant construct containing a regulatory sequence operably linked to 
a polynucleotide encoding an acyl CoAxholesterol acyltransferase-like polypeptide 
and culturing said host cell under conditions wherein said host cell expresses an acyl 
CoAxholesterol acyltransferase-like polypeptide such that said host cell has an altered 
oil production as compared to host cells without the recombinant construct. 



96. 



The method of claim 95, wherein said acy} CoAxholesterol acylh-ansferase-Hke 
polypeptide is a plant acyl CoAxholesterol acyltransferase-like polypeptide. 
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.97. The method of claim 93, wherein said oil production is increased. 

98. The method of claim 95, wherem said oil production is increased. 

99. The method of claim 93, wherein said host cell is a plant cell. 

100. The method of claim 95, wherein said host cell is a plant cell. 

101 . The method of claim 93, wherein said polynucleotide encoding a lecithinxholesterol 
acyltransferase-like polypeptide is selected from the group consisting of SEQ ID NO: 
2, 4, 6, 8, 10, 11,73 and 75. 

102. The method of clairii 95, wherein said polynucleotide encoding an acyl 
CoA:cholesterol acyltransferase-like polypeptide is SEQ ID NO: 33 or 42. 

103. The method of claim 93, wherein said regulatory sequence is a tissue specific 



104. The method of claim 95, wherein said regulatory sequence is a tissue specific 
promoter. 

105. The method of claim 93, wherein ^id regulatory sequence is a seed specific promoter. 

106. The method of claim 95, wherein said regulatory sequence is a seed specific promoter. 

107* A plant comprising a recombinant construct containing a regulatory sequence 

operably linked to a polynucleotide encoding a lecitbinrcholesterol acyltransferase- 
like polypeptide wherein expression of said recombinant construct results in an altered 
production of oil by said plant as compared to the same plant without said"" 
5 recombinant construct, 



promoter. 
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The plant of claim 1 07, wherein said lecithinxholesterol acyltransferase-like 
polypeptide is a plant lecithinxholesterol acyltransferase-like polypeptide, 

1 09. A plant comprising a recombinant construct containing a regulatory sequence 
operably linked to a polynucleotide encoding an acyl CoAxholesterol acyltransferase- 
like polypeptide wherein expression of said recombinant construct results in an altered 
production of oil by said plant as compared to the same plant without said 
recombinant construct. 

1 10. The plant of claim 109, wherein said acyl CoAxholesterol acyltransferase-like 
polypeptide is a plant acyl CoAxholesterol acyltransferase-like polypeptide. 

111. The plant of claim 1 07, wherein said oil production is increased. 

1 12. The plant of claim 109, wherein said oil production is increased. 

113. The plant of claim 107, wherein said polynucleotide encoding a lecithinxholesterol 
acyltransferase-like polypeptide is selected from the group consisting of SEQ IDNO: 
2,4,6,8. 10, 11, 73 and 75. 

114. The plant of claim 109, wherein said polynucleotide encoding an acyl 
CoAxholesterol acyltransferase-like polypeptide is. SEQ ID NO: 33 or 42. 

115. The plant of claim 107, wherein said regulatory sequence is a tissue specific promoter. 

1 16. The plant of claim 109, wherein said regulatory sequence is a tissue specific promoter. 

1 1 7. The plant of claim 1 07, wherein said regulatory sequence is a seed specific promoter. 

118. The plant of claim 109, wherein said regulatory sequence is a seed specific promoter. 

119. A food product comprising the oil of claim 90 or 92. 
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120. A food product comprising the plant of claim 107 or 109. 
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SEQUENCE LISTING 

<110> Monsanto Company 

<120> PLANT STEROL ACYLTRANSFERASES 

<130> MTC6718 

5 <140> 
<141> 

<150> 60/152,493 
<151> 199a-08-30 

<160> 80 

10 <170> Patentin Ver. 2.1 

<210> 1 
<211> 440 
<212> PRT 

<213> Homo sapiens 
15 <400> 1 

Met Gly Pro Pro Gly Ser Pro Trp Qln Trp Val Thr Leu Leu Leu Gly 
15 10 15 

Leu Leu Leu Pro Pro Ala Ala Pro Phe Trp Leu Leu Asn Val Leu Phe 
20 25 30 

20 Pro Pro His Thr Thr Pro Lys Ala Glu Leu Ser Asn His Thr Arg Pro 
35 40 45 

Val lie Leu Val Pro Gly Cys Leu Gly Asn Gin Leu Glu Ala Lys Leu 
50 55 60 

Asp Lys Pro Asp Val Val Asn Trp Met Cys Tyr Arg Lys Thr Glu Asp 
25 65 70 75 80 



Phe Phe Thr lie Trp Leu Asp Leu 
85 

Asp Cys Trp He Asp Asn Thr Arg 
100 

30 Leu Val Ser Asn Ala Pro Gly Val 
115 120 

Lys Thr Tyr Ser Val Glu Tyr Leu 
130 135 

Leu His Thr Leu Val Gin Asn Leu 
35 145 150 



Asn Met Phe Leu Pro Leu Gly Val 
90 95 

Val Val Tyr Asn Arg Ser Ser Gly 
105 110 

Gin He Arg Val Pro Gly Phe Gly 
125 

Asp Ser Ser Lys Leu Ala Gly Tyr 
140 

Val Asn Asn Gly Tyr Val Arg Asp 
155 160 



Glu Thr Val Arg Ala Ala Pro Tyr Asp Trp Arg Leu Glu Pro Gly Gin 
165 170 175 
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Gin Glu Glu Tyr Tyr. Arg Lys Leu Ala Gly Leu Val Glu Qlu Met His 
180 185 190 

. Ala Ala Tyr Gly Lys Pro Val Phe Leu lie Gly His Ser Leu Gly Cys 
195 200 205 

5 Leu His Leu Leu Tyr Phe Leu Leu Arg Gin Pro Gin Ala Trp Lys Asp 
210 215 220 

Arg Phe lie Asp Gly Phe lie Ser Leu Gly Ala Pro Trp Gly Gly Ser 
225 230 235 240 

He Lys Pro Met Leu Val Leu Ala Ser Gly Asp Asn Gin Gly He Pro 
10 245 250 255 

He Met Ser Ser He Lys Leu Lys Glu Glu Gin Arg He Thr Thr Thr 
260 265 270 

Ser Pro Trp Met Phe Pro Ser Arg Met Ala Trp Pro Glu Asp His Val 
275 280 285 

15 Phe He Ser Thr Pro Ser Phe Asn Tyr Thr Gly Arg Asp Phe Gin Arg 
290 295 ' 300 

Phe Phe Ala Asp Leu His Phe Glu Glu Gly Trp Tyr Met Trp Leu Gin 
305 310 315 320 

Ser Arg Asp Leu Leu Ala Gly Leu Pro Ala Pro Gly Val Qlu Val Tyr 
20 325 330 335 

Cys Leu Tyr Gly val Gly Leu Pro Thr Pro Arg Thr Tyr He Tyr Asp 
340 345 350 

His Gly Phe Pro Tyr Thr Asp Pro Val Gly Val Leu Tyr Glu Asp Gly 
355 360 365 

25 Asp Asp Thr Val Ala Thr Arg Ser Thr Glu Leu Cys Gly Leu Trp Gin 
370 375 380 

Gly Arg Gin Pro Gin Pro Val His Leu Leu Pro Leu His Gly He Gin 
385 390 395 400 

His Leu Asn Met Val Phe Ser Asn Leu Thr Leu Glu His He Asn Ala 
30 405 410 415 

He Leu Leu Gly Ala Tyr Arg Gin Gly Pro Pro Ala Ser Pro Thr Ala 
420 425 430 



Ser Pro Glu Pro Pro Pro Pro Glu 
435 440 
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<210> 2 
<211> 1299 
<212> DNA 

<213> Arabidopsis thaliazia 

5 <400> 2 

atgaaaaaaa tatcttcaca ttattcggta 
acctcgatgt gtcaagctgt gggtagcaac 
ggaggtaacc agctagaggt acggctggac 
agcagctggt tatatccgat tcataagaag 

10 gcagcagtgt tattgtctcc cttcaccagg 
gaccctgatt tggatgatta ccaaaatgct 
ggttcgacca aatcacttct atacctcgac 
gaacatttgg tgaaagctct agagaaaaaa 
ggagctccat atgatttcag gtacggcctg 

15 tcacagttcc tacaagacct caaacaattg 
aagccagtga tactcctctc ccatagccta 
cgtaccaccc cttcatggcg ccgcaagtac 
tggggtggga cgatctctca gatgaagaca 
ttagttaacc ctttgctggt cagacggcat 

20 cttccatcta ccaaagtgtt tcacgacaga 
aactacacag cttacgagat ggatcggttt 
gtgccttaca agacaagagt gttgccttta 
gtcacttgca tatatgggag aggagttgat 
ggattcgata agcaaccaga gattaagtat 

25 agcttagcag ctttgaaagt cgatagcttg 
acatctatac ttaaagacga gatcgcactt 
aattatgaat tagccaatgt taatgccgtc 



gtcatagcga tactcgttgt ggtgacgatg 60 

gtgtaccctt tgattctggt tccaggaaac 120 

agagaataca agccaagtag tgtctggtgt 180 

agtggtggat ggtttaggct atggttcgat 240 

tgcttcagcg atcgaatgat gttgtactat 300 

cctggtgtcc aaacccgggt tcctcatttc 360 

cctcgtctcc gagatgccac atcttacatg 420 

tgcgggtatg ttaacgacca aaccatccta 4 80 

gctgcttcgg gccacccgtc ccgtgtagcc 540 

gtggaaaaaa ctagcagcga gaacgaagga 600 

ggaggacttt tcgtcctcca tttcctcaac 660 

atcaaacact ttgttgcact cgctgcgcca 720 

tttgcttctg gcaacacact cggtgtccct 780 

cagaggacct ccgagagtaa ccaatggcta 840 

actaaaccgc ttgtcgtaac tccccaggtt 900 

tttgcagaca ttggattctc acaaggagtt 960 

acagaggagc tgatgactcc gggagtgcca 1020 

acaccggagg ttttgatgta tggaaaagga 1080 

ggagatggag atgggacggt taatttggcg 1140 

aacaccgtag agattgatgg agtttcgcat 1200 

aaagagatta tgaagcagat ttcaattatt 1260 
aatgaatga 1299 



<210> 3 
<211> 432 
30 «:212> PRT 

<213> Arabidopsis thaliana 

<400> 3 

Met Lys Lys He Ser Ser His Tyr Ser Val Val lie Ala He Leu Val 
1 5 . 10 15 

35 Val Val Thr Met Thr Ser Met Cys Gin Ala Val Gly Ser Asn Val Tyr 
20 25 30 

Pro Leu He Leu Val Pro Gly Asn Gly Gly Asn Gin Leu Glu Val Arg 
35 40 45 

Leu Asp Arg Glu Tyr Lys Pro Ser Ser Val Trp Cys Ser Ser Trp Leu 
40 50 55 60 

Tyr Pro He His Lys Lys Ser Gly Gly Trp Phe Arg Leu Trp Phe Asp 
65 . 70 75 80 

Ala Ala Val Leu Leu Ser Pro Phe Thr Arg Cys Phe Ser Asp Arg Met 
85 90 95 



45 



Met Leu Tyr Tyr Asp Pro Asp Leu Asp Asp Tyr Gin. Asn Ala Pro Gly 
100 105 110 
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Val Gin Thr Arg Val Pro His Phe Gly Ser Thr Lys Ser Leu Leu Tyr 
115 X20 125 

Leu Asp Pro Arg Leu Arg Asp Ala Thr Ser Tyr Met Glu His Leu Val 
130 135 140 

5 Lys Ala Leu Glu Lys Lys Cys Gly Tyr Val Asn Asp Gin Thr He Leu 
145 150 155 160 

Gly Ala Pro Tyr Asp Phe Arg Tyr Gly Leu Ala Ala Ser Gly His Pro 
165 170 175 

Ser Arg Val Ala Ser Gin Phe Leu Gin Asp Leu Lys Gin Leu Val Glu 
10 180 185 X90 

Lys Thr Ser Ser Glu Asn Glu Gly Lys Pro Val He Leu Leu Ser His 
195 200 205 

Ser Leu Gly Gly Leu Phe Val Leu His Phe Leu Asn Arg Thr Thr Pro 
210 215 220 

15 Ser Trp Arg Arg Lys Tyr He Lys His Phe Val Ala Leu Ala Ala Pro 
225 230 235 240 

Trp Gly Gly Thr He Ser Gin Met Lys Thr Phe Ala Ser Gly Asn Thr 
245 250 255 

Leu Gly Val Pro Leu Val Asn Pro Leu Leu Val Arg Arg His Gin Arg 
20 260 265 270 

Thr Ser Glu Ser Asn Gin Trp Leu Leu Pro Ser Thr Lys Val Phe His 
275 280 285 

Asp Arg Thr Lys Pro Leu Val Val Thr Pro Gin Val Asn Tyr Thr Ala 
290 295 300 

25 Tyr Glu Met Asp Arg Phe Phe Ala Asp He Gly Phe Ser Gin Gly Val 
305 310 315 320 

Val Pro Tyr Lys Thr Arg Val Leu Pro Leu Thr Glu Glu Leu Met Thr 
325 330 335 



Pro Gly Val Pro Val Thr Cys He Tyr Gly Arg Gly Val Asp Thr Pro 
30 340 345 350 

Glu Val Leu Met Tyr Gly Lys Gly Gly Phe Asp Lys Gin Pro Glu He 
355 360 365 

Lys Tyr Gly Asp Gly Asp Gly Thr Val Asn Leu Ala Ser Leu Ala Ala 
370 375 380 

35 Leu Lys Val Asp Ser Leu Asn Thr Val Glu He Asp Gly Val Ser His 
385 390 395 400 

Thr Ser He Leu Lys Asp Glu He Ala Leu Lys Glu He Met Lys Gin 
405 410 415 



erwirv Mir* nA^atnah^t i ^ 
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lie Ser lie lie Asn Tyr Glu Leu Ala Asn Val Asn Ala Val Asn Glu 
420 425 430 



<210> 4 
5 <211> 1641 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 4 

atgggagcga attcgaaatc agtaacggct tccttcaccg tcatcgccgt ttttttcttg 60 

10 atttgcggtg gccgaactgc ggtggaggat gagaccgagt ttcacggcga ctactcgaag 120 
ctatcgggta taatcattcc gggatttgcg tcgacgcagc tacgagcgtg gtcgatcctt 180 
gactgtccat acactccgtt ggacttcaat ccgctcgacc tcgtatggct agacaccact 240 
aagcttcttt ctgctgtcaa ctgctggttt aagtgtatgg bgctagatcc ttataatcaa 300 
acagaccatc ccgagtgtaa gtcacggcct gacagtggtc tttcagccat cacagaattg 360 

15 gatccaggtt acataacagg tcctctttct actgtctgga aagagtggct taagtggtgt 420 
gttgagtttg gtatagaagc aaatgcaatt gtcgctgttc catacgattg gagattgtca 4 80 
ccaaccaaat tggaagagcg tgacctttac tttcacaagc tcaagttgac ctttgaaact 540 
gctttaaaac tccgtggcgg cccttctata gtatttgccc attcaatggg taataatgtc 600 
ttcagatact ttctggaatg gctgaggcta gaaattgcac caaaacatta tttgaagtgg 660 

20 cttgatcagc atatccatgc ttatttcgct gttggagctc ctcttcttgg ttctgttgag 720 
gcaatcaaat ctactctctc tggtgtaacg tttggccttc ctgtttctga gggaactgct 780 
cggttgttgt ccaattcttt tgcgtcgtca ttgtggctta tgccattttc aaagaattgc 840 
aagggtgata acacatcctg gacgcatttt tctgggggtg ctgcaaagaa agataagcgc 90 0 
gtataccact gtgatgaaga ggaatatcaa tcaaaatatt ctggctggcc gacaaatatt 960 

25 attaacattg aaattccttc cactagcgtt acagaaacag. ctctagtcaa catgaccagc 1020 
atggaatgtg gccttcccac ccttttgtct ttcacagccc gtgaactagc agatgggact 1080 
cttttcaaag caatagaaga ctatgaccca gatagcaaga ggatgttaca ccagttaaag 1140 
aagttgtatc atgatgaccc tgtttttaat cctctgactc cttgggagag accacctata 1200 
aaaaatgtat tttgcatata tggtgctcat ctaaagacag aggttggtta ttactttgcc 1260 

30 ccaagtggca aaccttatcc tgataattgg atcatcacgg atatcattta cgaaactgaa 1320 
ggttccctcg tgtcaaggtc tggaactgtg gttgatggga acgctggacc tataactggg 1380 
gatgagacgg taccctatca ttcactctct tggtgcaaga attggctcgg acctaaagtt 1440 
aacataacaa tggctcccca gccagaacac gatggaagcg acgtacatgt ggaactaaat 1500 
gttgatcatg agcatgggtc agacatcata gctaacatga caaaagcacc aagggttaag 1560 

35 tacataacct tttatgaaga ctctgagagc attccgggga agagaaccgc agtctgggag 1620 
cttgataaaa gtgggtatta a 1641 



<210> 5 
<211> 546 
<212> PRT 
40 <213> Arabidopsis thaliana 

<400> 5 

Met Qly Ala Asn Ser Lys Ser Val Thr Ala Ser Phe Thr Val lie Ala 
15 10 15 

val Phe Phe Leu He Cys Gly Gly Arg Thr Ala Val Glu Asp Glu Thr 
45 20 25 30 



Glu Phe His Gly Asp Tyr Ser Lys Leu Ser Gly He He He Pro Gly 
35 40 45 
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Phe Ala Ser Thr Gin Leu Arg Ala Trp Ser He Leu Asp Cys Pro Tyr 
50 55 60 

Thr Pro Leu Asp Phe Asn Pro Leu Asp Leu Val Trp Leu Asp Thr Thr 
65 70 75 80 

5 Lys Leu Leu Ser Ala Val Asn Cys .Trp Phe Lys Cys Met Val Leu Asp 

85 90 95 

Pro Tyr Asn Gin Thr Asp His Pro Glu Cys Lys Ser Arg Pro Asp Ser 
100 105 110 

Gly Leu Ser Ala He Thr Glu Leu Asp Pro Gly Tyr He Thr Gly Pro 
10 115 120 125 

Leu Ser Thr Val Trp Lys Glu Trp Leu Lya Trp Cys Val Glu Phe Gly 
130 135 140 

He Glu Ala Asn Ala He Val Ala Val Pro Tyr Asp Trp Arg Leu Ser 
145 150 155 160 

15 Pro Thr Lys Leu Glu Glu Arg Asp Leu Tyr Phe Hi a Lys Leu Lys Leu 

165 170 ' 175 

Thr Phe Glu Thr Ala Leu Lys Leu Arg Gly Gly Pro Ser He Val Phe 
180 185 190 

Ala His Ser Met Gly Asn Asn Val Phe Arg Tyr Phe Leu Glu Trp Leu 
20 195 . 200 205 

Arg Leu Glu He Ala Pro Lys His Tyr Leu Lys Trp Leu Asp Gin His 
210 215 220 

He His Ala Tyr Phe Ala Val Gly Ala Pro Leu Leu Gly Ser Val Glu 
225 230 235 240 

25 Ala He Lys Ser Thr Leu Ser Gly Val Thr Phe Gly Leu Pro Val Ser 

245 250 255 

Glu Gly Thr Ala Arg Leu Leu Ser Asn Ser Phe Ala Ser Ser Leu Trp 
260 265 270 

Leu Met Pro Phe Ser Lys Asn Cys Lys Gly Asp Asn Thr Ser Trp Thr 
30 275 280 285 

His Phe Ser Gly Gly Ala Ala Lys Lys Asp Ly& Arg Val Tyr His Cys 
290 295 300 

Asp Glu Glu Glu Tyr Gin Ser Lys Tyr Ser Gly Trp Pro Thr Asn He 
305 310 315 320 



35 He Asn He Glu He Pro Ser Thr Ser Val Thr Glu Thr Ala Leu Val 

325 330 335 



Asn Met Thr Ser Met Glu Cys Gly Leu Pro Thr Leu Leu Ser 
340 345 350 



Phe Thr 
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Ala Arg Glu Leu Ala Asp Gly Thr Leu Phe Lys Ala He Glu Asp Tyr 
355 360 365 

Asp Pro Asp Ser Lys Arg Met Leu His Gin Leu Lys Lys Leu Tyr His 
370 375 380 

5 Asp Asp Pro Val Phe Asn Pro Leu Thr Pro Trp Glu- Arg Pro Pro- He 
385 390 395 400 

Lys Asn Val Phe Cys He Tyr Gly Ala His Leu Lys Thr Glu Val Gly 
405 410 415 

Tyr Tyr Phe Ala Pro Ser Gly Lys Pro Tyr Pro Asp Asn Trp He He 
10 420 425 430 

' Thr Asp He He Tyr Glu Thr Olu Gly Ser Leu Val Ser Arg Ser Gly 
435 440 445 

Thr Val Val Asp Gly Asn Ala Gly Pro He Thr Gly Asp Glu Thr Val 
450 455 460 

15 Pro Tyr His Ser Leu Ser Trp Cys Lys Asn Trp Leu Gly Pro Lys Val 
465 470 475 480 

Asn He Thr Met Ala Pro Gin Pro Glu His Asp Gly Ser Asp Val His 
485 490 49S 

Val Glu Leu Asn Val Asp His Glu His Gly Ser Asp He He Ala Asn 
20 500 505 510 

Met Thr Lys Ala Pro Arg Val Lys Tyr He Thr Phe Tyr Glu Asp Ser 
515 520 525 

Glu Ser He Pro Gly Lys Arg Thr Ala Val Trp Glu Leu Asp Lys Ser 
530 535 540 

25 Gly Tyr 
545 



<210> 6 
<211> 1608 
<212> UNA 
30' <213> Arabidopsis thaliana 

<400> 6 

atgtctctat tactggaaga gatcattaga tcagtagagg ctttgctgaa gctcagaaat 60 

cggaatcaag aaccctatgt tgacccgaat ctaaacccgg ttcttctagt tccaggaatc 120 

gctggatcaa ttctaaacgc cgttgatcat gagaacggga acgaagaacg tgtttgggtt 180 

35 aggatctttg gtgctgatca tgagtttcga acaaagatgt ggtctcgatt tgatccttca 240 

actggtaaaa cgatatctct tgatccaaaa acgagtattg ttgttcctca agacagagct 300 

gggctacatg caattgatgt cttagaccct gatatgattg ttggccgtga gtctgtgtac 360 

tatttccatg agatgattgt tgagatgatc ggatggggat ttgaagaagg gaaaaccctt 420 

tttggttttg gttatgattt ccgccaaagc aacagactgc aggaaacgtt ggaccagttt 480 

40 gctaaaaagt tggaaactgt ttataaagcc tcaggagaga agaagattaa tgttattagt 540 

cattctatgg gaggactatt ggtgaaatgt ttcatgggtc tgcatagtga tatattcgag 600 
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aagtatgtac agaattggat tgctattgct gctccatttc gaggtgctcc tggatatatc 660 
acatcgactt tattgaatgg aatgtcgttt gtcaatggtt gggaacagaa ctttttcgtc 720 
tctaagtgga gcatgcatca gctgcttatt gagtgtccat ccatatatga gctgatgtgt 780 
tgtccgtatt ttaaatggga gctccctccc gtcttagagc tgtggagaga gaaagagagc 840 
5 aatgatggag ttggaacctc tgatgttgtt cttgagtctt accgtagcct ggagagcctt 900 
gaagttttta cgaaatctct ttcgaataat acagctgatt attgtggaga gtcgatcgat 960 
cttcctttta actggaagat catggagtgg gctcacaaaa ccaagcaagt attagcctct 1020 
gccaagctgc ctccgaaagt taaattctat aacatatatg ggaccaatct agaaacccct 1080 
catagtgttt gctatgggaa tgagaagatg cccgttaaag,atctaacgaa tctaagatac 1140 

10 ttccagccga catatatatg cgtggatggt gatggcacag tcccgatgga atctgccatg 1200 
gcggatgggc ttgaagcagt agcaagagtt ggagtccctg gtgagcaccg aggaatcctc 1260 
aacgatcacc gtgtcttccg aatgctcaaa aaatggctaa atgtaggcga accagacccg 1320 
ttctacaacc cagtaaacga ttatgtcatc cttcccacca catatgaatt tgagaaattc 1380 
catgagaatg gactcgaggt tgcttccgtg aaagaatcgt gggacatcat atcagatgac 1440 

15 aacaatatcg gcacaaccgg gtcaaccgtg aactccatat cagtctctca acctggagat 1500 
gatcaaaacc ctcaagctga agctcgtgca accttaaccg tccaaccaca aagcgatggt 1560 
agacaacatg tagagctcaa tgctgtaagt gtctctgttg atgcataa 1608 



<210> 7 
<211> 535 
20 <212> PRT 

<213> Arabldopsls thaliana 

<400> 7 

Met Bex Leu Leu Leu Glu Glu He He Arg Ser Val Glu Ala Leu Leu 
15 10 15 

25 Lys Leu Arg Asn Arg Aan Gin Glu Pro Tyr Val Asp Pro Asn Leu Asn 
20 25 30 

Pro Val Leu Leu Val Pro Gly He Ala Gly Ser He Leu Asn Ala Val 
35 40 45 

Asp His Glu Asn Gly Asn Glu Glu Arg Val Trp Val Arg He Phe Gly 
30 50 55 60 

Ala Asp His Glu Phe Arg Thr Lys Met Trp Ser Arg Phe Asp Pro Ser 
65 70 75 80 

Thr Gly Lys Thr He Ser Leu Asp Pro Lys Thr Ser He Val Val Pro 
85 90 95 

35 Gin Asp Arg Ala Gly Leu His Ala He Asp Val Leu Asp Pro Asp Met 
100 ' ' 105 110 

He Val Gly Arg Glu Ser Val Tyr Tyr Phe His Glu Met He Val Glu 
115 120 125 

Met He Gly Trp Gly Phe Glu Glu Gly Lys Thr Leu Phe Gly Phe Gly 
40 130 135 140 

Tyr Asp Phe Arg Gin Ser Asn Arg Leu Gin Glu Thr Leu Asp Gin Phe 
145 150 155 160 
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Ala Lys Lye Leu Glu Thr Val Tyr Lys Ala Ser Gly Glu Lys Lys lie 
165 170 175 

Asn Val He Ser His Ser Met Gly Gly Leu Leu Val Lys Cys Phe Met 
180 185 190 

5 Gly Leu His Ser Asp He Phe Glu Lys Tyr Val Gin Asn Trp He Ala 
195 200 205 

He Ala Ala Pro Phe Arg Gly Ala Pro Gly Tyr He Thr Ser Thr Leu 
210 215 220 

Leu Asn Gly Met Ser Phe Val Asn Gly Trp Glu Gin Asn Phe Phe Val 
10 225 230 235 240 

Ser Lys Trp Ser Met His Gin Leu Leu He Glu Cys Pro Ser He Tyr 
245 250 255 

Glu Leu Met Cys Cys Pro Tyr Phe Lys Trp Glu Leu Pro Pro Val Leu 
260 265 270 

15 Glu Leu Trp Arg Glu Lys Glu Ser Asn Asp Gly Val Gly Thr Ser Asp 
275 280 285 

Val Val Leu Glu Ser Tyr Arg Ser Leu Glu Ser Leu Glu Val Phe Thr 
290 295 300 

Lys Ser Leu Ser Asn Asn Thr Ala Asp Tyr Cys Gly Glu Ser He Asp 
20 305 310 315 320 

Leu Pro Phe Asn Trp Lys He Met Glu Trp Ala His Lys Thr Lys Gin 
• 325 330 335 

Val Leu Ala Ser Ala Lys Leu Pro Pro Lys Val Lys Phe Tyr Asn He 
340 345 350 

25 Tyr Gly Thr Asn Leu Glu Thr Pro His Ser Val Cys Tyr Gly Asn Glu 
355 360 365 

Lys Met Pro Val Lys Asp Leu Thr Asn Leu Arg Tyr Phe Gin Pro Thr 
370 375 380 

Tyr He Cys Val Asp Gly Asp Gly Thr Val Pro Met Glu Ser Ala Met 
30 385 390 395 400 

Ala Asp Gly Leu Glu Ala Val Ala Arg Val Gly Val Pro Gly Glu His 
405 410 415 

Arg Gly He Leu Asn Asp His Arg Val Phe Arg Met Leu Lys Lys Trp 
420 425 430 

35 Leu Asn Val Gly Glu Pro Asp Pro Phe Tyr Asn Pro Val Asn Asp Tyr 
435 440 445 

Val He Leu Pro Thr Thr Tyr Glu Phe Glu Lys Phe His Glu Asn Gly 
450 , 455 460 
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Leu Glu Val Ala Ser Val Lys Glu Ser Trp Asp lie lie Ser Asp Asp 
465 470 475 480 

Asn Asn lie Gly Thr Thr Gly Ser Thr Val Asn Ser lie Ser Val Ser 
485 490 495 

5 Gin Pro Gly Asp Asp Gin Asn Pro Gin Ala Glu Ala Arg Ala Thr Leu 
500 505 510 

Thr Val Gin Pro Gin Ser Asp Gly Arg Gin His Val Glu Leu Asn Ala 
515 520 525 

Val Ser Val Ser Val Asp Ala 
10 530 535 



<210> 8 
<211> 1344 . 
<212> DNA 

<213> Arabidopsis thaliana 
15 <400> 8 

atgggctgga ttccgtgtcc gtgctgggga accaacgacg atgaaaacgc cggcgaggtg 60 
gcggatcgtg atccggtgct tctagtatct ggaattggag gctctattct gcattctaag. 120 
aagaagaatt caaagtctga aattcgggtt tgggtccgaa tatttctagc taaccttgcc 180 
tttaagcaga gcctctggtc tctctataat cccaaaactg gttatacaga gccgttggat 240 

20 gataatattg aagtattggt ccctgatgat gaccatggac tctatgcaat tgacattcta 300 
gatccctctt ggtttgtaaa gctttgtcac ttgacggagg tttatcactt tcacgatatg 360 
atagaaatgc tggttggatg cggttataag aaggggacta cattattcgg ttatggttac 420 
gatttccgtc aaagcaatag gatcgatcta cttatactag gtctgaagaa gaagctggaa 480 
actgcatata aacgttcagg ggggagaaaa gtcactatca tctcccattc aatgggagga 540 

25 cttatggttt catgtttcat gtatctccat ccggaggcat tttccaagta tgtaaataaa 600 
tggattacaa ttgcaacacc tttccaagga gcaccagggt gcatcaatga ttcaatcttg 660 
actggagtgc aatttgtgga agggttagaa agtttctttt ttgtgtcacg ttggacgatg 720 
caccaactgt tggtcgaatg cccatctata tatgagatga tggcaaatcc agactttaag 780 
tggaaaaagc aaccagagat tcgagtttgg cgtaagaaat ctgaaaacga cgttgatact 840 

30 tctgtagaac tggaatcatt tggcttaatc gagagtattg atctattcaa cgatgcatta 900 
aaaaataacg agctaagcta tggtgggaat aaaatagctt tgccctttaa ctttgctatc 960 
ctcgactggg ctgctaagac aagagaaatt ctcaacaaag cgcaacttcc tgatggagtg. 1020 
tccttctata acatatatgg agtgtcactt aatacaccct ttgatgtttg ttatggcaca 1080 
gagacttctc cgatagacga tttgtctgaa atatgtcaaa ctatgcctga gtatacatat 114 0 

35 gtagatggag atggaactgt ccctgctgaa tcagctgcag ctgctcagtt taaagcagtt 1200 
gctagcgtag gagtttcggg tagccaccgc gggcttctcc gtgatgaaag agtgtttgag 1260 
ctcattcaac aatggttagg agttgagccc aagaaggcta aacggaagca tttaaggact 1320 
cacaaagtag ttgattctgg ttaa 1344 



. <210> 9 
40 <211> 447 
<212> PRT 

<213> Arabidopsis thaliana 

<400> 9 

Met Gly Trp lie Pro Cys Pro Cys Trp Gly Thr Asn Asp Asp Glu Asn 
45 1 5 10 ' 15 
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Ala Gly Glu Val Ala Asp Arg. Asp Pro Val Leu Leu Val Ser Gly lie 
20 25 30 

Gly Gly Ser lie Leu His Ser Lys Lys Lys Asn Ser Lys Ser Glu lie 
35 40 45 

5 Arg Val Trp Val Arg lie Phe Leu Ala Asn Leu Ala Phe Lys Gin Ser 
50 55 60 

Leu Trp Ser Leu Tyr Asn Pro Lys Thr Gly Tyr Thr Glu Pro Leu Asp 
65 70 75 80 

Asp Asn He Glu Val Leu Val Pro Asp Asp Asp His Gly Leu Tyr Ala 
10 85 90 95 

He Asp He Leu Asp Pro Ser Trp Phe Val Lys Leu Cys His Leu Thr 
100 105 110 

Glu Val Tyr His Phe His Asp Met He Glu Met Leu Val Gly Cys Gly 
115 120 125 

15 Tyr Lys Lys Gly Thr Thr Leu Phe Gly Tyr Gly Tyr Asp Phe Arg Gin 
130 135 140 

Ser Asn Arg He Asp Leu Leu He Leu Gly Leu Lys Lys Lys Leu Glu 
145 150 155 160 

Thr Ala Tyr Lys Arg Ser Gly Gly Arg Lys Val Thr He He Ser Hie 
20 165 170 175 

Ser Met Gly Gly Leu Met Val Ser Cys Phe Met Tyr Leu His Pro Glu 
180 185 190 

Ala Phe Ser Lys Tyr Val Asn Lys Trp He Thr He Ala Thr Pro Phe 
195 200 205 

25 Gin Gly Ala Pro Gly Cys He Asn Asp Ser He Leu Thr Gly Val Gin 
210 215 220 

Phe Val Glu Gly Leu Glu Ser Phe Phe Phe Val Ser Arg Trp Thr Met 
225 230 235 240 

His Gin Leu Leu Val Glu Cys Pro Ser He Tyr Glu Met Met Ala Asn 
30 '245 250 255 

Pro Asp Phe Lys Trp Lys Lys Gin Pro Glu He Arg Val Trp Arg Lys 
260 265 . 270 

Lys Ser Glu Asn Asp Val Asp Thr Ser Val Glu Leu Glu Ser Phe Gly 
275 280 285 

35 Leu He Glu Ser He Asp Leu Phe Asn Asp Ala Leu Lys Asn Asn Glu 
290 295 300 



Leu Ser Tyr Gly Gly Asn Lys He Ala Leu Pro Phe Asn Phe Ala He 
305 310 315 320 
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Leu Asp Trp Ala Ala Lys Thr Arg Glu lie Leu Asn Lys Ala Gin Leu 
325 33a 335 

Pro Asp Gly Val Ser Phe Tyr Asn He Tyr Gly Val Ser Leu Asn Thr 
340 345 350 

5 Pro Phe Asp Val Cys Tyr Gly Thr Glu Thr Ser Pro He Asp Asp Leu 
355 360 365 

Ser Glu He Cys Gin Thr Met Pro Glu Tyr Thr Tyr Val Asp Gly Asp 
370 375 380 

Gly Thr Val Pro Ala Glu Ser Ala Ala Ala Ala Gin Phe Lys Ala Val 
10 385 390 395 400 

Ala Ser Val Gly Val Ser Gly Ser His Arg Gly Leu Leu Arg Asp Glu 
405 • 410 415 

Arg Val Phe Glu Leu He Gin Gin Trp Leu Gly Val Glu Pro Lys Lys 
420 425 430 

15 Ala Lys Arg Lys His Leu Arg Thr His Lys Val Val Asp Ser Gly 
435 440 445 



<210> 10 
<211> 3107 
<212> DNA 
20 <213> Arabidopsis thaliana 

<400> 10 

cctttttgat ctttcagctc aatgagcttt tctcaatttt ttgggggaac tgaatatgtg. 60 
aatttcaaag tttccacatc gagtttattc acacgtcttg aatttcgtcc atcctcgttc 120 
tgttatccag ctttgaactc ctcccgaccc tgctatggat atattaaaaa aaaagtgttt 180 

25 tgtgggttgc atctttgtta cgatctgcat cttcttcttt cggctcagtg ttcatgtttt 240 
tgctatggta gagatgggca atgttattgt tgatggtaac agtggtatag ttgatagtat 300 
cttaactaat caattatctc tttgattcag gcctctatgt tgggtggaac acatgtcact 360 
tgacaatgaa actgggttgg atccagctgg tattagagtt cgagctgtat caggactcgt 420 
ggctgctgac tactttgctc ctggctactt tgtctgggca gtgctgattg ctaaccttgc 480 

30 acatattgga tatgaagaga aaaatatgta catggctgca tatgactggc ggctttcgtt 540 
tcagaacaca gaggttcttt tctcatcgtt ctttctatta ttctgttcca tgttacgttt 600 
ctttcttcat tacttaaggc ttaaatatgt ttcatgttga attaataggt acgtgatcag 660 
actcttagcc gtatgaaaag taatatagag ttgatggttt ctaccaatgg tggaaaaaaa 720 
gcagttatag ttccgcattc catgggggtc ttgtattttc tacattttat gaagtgggtt 780 

35 gaggcaccag ctcctctggg tggcgggggt gggccagatt ggtgtgcaaa gtatattaag 84 0 
gcggtgatga acattggtgg accatttctt ggtgttccaa aagctgttgc agggcttttc 900 
tctgctgaag caaaggatgt tgcagttgcc aggtattgaa tatctgctta tacttttgat 960 
gatcagaacc ttggctctgg aactcaaagt tattctacta aatatcaatt ctaataacat 1020 
tgctatatta tcgctgcaac tgacattggt tgattatttt gctgcttatg taactgaaac 1080 

40 tctcttgaga ttagacaaat gatgaattga taattcttac gcattgctct gtgatgacca 1140 
gtttcttagc ttcgacgata acatttgtca tactgtcttt tggagggcat tgaattttgc 1200 
tatggaaagc gctggagctt ccatgcttgc attctttacc aattagcatt attctgcttc 1260 
tttcaatttt cttgtatatg catctatggt cttttatttc ttcttaatta aagactcgtt 1320 
ggagtagttg ctctattagt cgcttggttc cttaatatag aactttactt tcttcgaaaa 1380 

45 ttgcagagcg attgccccag gattcttaga caccgatata tttagacttc agaccttgca 1440 
gcatgtaatg agaatgacac gcacatggga ctcaacaatg tctatgttac cgaagggagg 1500 
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10 



15 

} 

20 



25 



30 

) 

35 



40 



45 



tgacacaata tggggcgggc ttgattggtc accggagaaa ggccacacct gttgtgggaa 1560 
aaagcaaaag aacaacgaaa cttgtggtga agcaggtgaa aacggagttt ccaagaaaag 1620 
tcctgttaac tatggaagga tgatatcttt tgggaaagaa gtagcagagg ctgcgccatc 1680 
tgagattaat aatattgatt ttcgagtaag gacatataaa tcataataaa ccttgtacat 174 0 
tttgtgattg tatgatgaat atctgtacat tttatctggt gaagggtgct gtcaaaggtc 1800 
agagtatccc aaatcacacc tgtcgtgacg tgtggacaga gtaccatgac atgggaattg 1860 
ctgggatcaa agctatcgct gagtataagg tctacactgc tggtgaagct atagatctac 1920 
tacattatgt tgctcctaag atgatggcgc gtggtgccgc tcatttctct tatgggattg 1980 
ctgatgattt ggatgacacc aagtatcaag atcccaaata ctggtcaaat ccgttagaga 2040 
caaagtaagt gatttcttga ttccaactgt atccttcgtc ctgatgcatt atcagtcttt 2100 
ttgttttcgg tcttgttgga tatggttttc agctcaaagc ttacaaagct gtttctgagc 2160 
ctttctcaaa aaggcttgct cagttatatt gaggtgctaa agttgataca tgtgactctt 2220 
gcttataaat cctccgtttg gtttgttctg ctttttcaga ttaccgaatg ctcctgagat 2280 
ggaaatctac t'cattatacg gagtggggat accaacggaa cgagcatacg tatacaagct 2340 
taaccagtct cccgacagtt gcatcccctt tcagatattc acttctgctc acgaggagga 2400 
cgaagatagc tgtctgaaag caggagttta caatgtggat ggggatgaaa cagtacctgt 2460 
cctaagtgcc gggtacatgt gtgcaaaagc gtggcgtggc aagacaagat tcaacccttc 2520 
cggaatcaag acttacataa gagaatacaa tcactctccg ccggctaacc tgttggaagg 2580 
gcgcgggacg cagagtggtg cccatgttga tatcatggga aactttgctt tgatcgaaga 2640 
tatcatgagg gttgccgccg gaggtaacgg gtctgatata ggacatgacc aggtccactc 2700 
tggcatattt gaatggtcgg agcgtattga cctgaagctg tgaatatcat gatctcttta 2760 
agctgtcctg tcagcttatg tgaatccaat actttgaaag agagatcatc atcaattcat 2820 
catcatcgtc atcatcatga tgctcaactc acaaagaagc ctgagaatga tactttggtg 2880 
cgaaattctG aatacctctt taatattctt attgaatgta aattatacaa tcctatctaa 2940 
tgtttgaacg ataacgcaaa acttgctgcg ccatgtttgt ttgtcttgtc aaaagcatca 3000 
atttgtgggt tatacgtagt gtagaggatg attcaaattt gtgataaatt tggtaatcaa 3060 
agttaattct gaaaatgcaa caccacatga actatgtcac taaggcc 3107 



<210> 11 
<211> 1680 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> unsure 
<222> (694) 
<223> n»un]cown 



cgcataaggt gttcgagtgt ttgcagcttg agaagttccg agtccaagag acctggagcc 60 
aaagatctga accataaaaa tgaccaatca aaatccatta agccaattca aatattcact 120 
aaaaatgtta tagttctcat gaatactaac ataacaagtg aaagtaaatt taaaaatgtt 180 
catggaccta acctggcgta acggtatgtc tttgccttca gcagaaagta aattactgac 240 
ggctttaggg acacctaaaa aggcgggtcc aatgttgacg acggatttga tgtgtttggc 300 
acaccaacct ggaccacccc caccgcctcc atcaggaaga ggtgtttcta cccatttaag 360 
gaagtgaagg aaatagatag cccccattga atgcggaacc accacaactt tcttaaaccc 420 
attggtggca tacattagct cgattttgct cttcagtcta cttaacgatt ggtcacgtac 480 
ctgctcggtt tcaatccaaa aactatagat tagtccaaag ctctacaaca atatgtaatt 540 
acatacacta aagtagctaa tcatggaggt cttatagtat atcattatca tcattctcta 600 
gaccaccagt gttgtcaatg tgatcatata ggtattaata acgactaatc tgagcatacc 660 
tcggtgttat ggaaagagag tctccaatca taangaggcc atgtgaaggt tcttgccttc 720 
atatccaatt tttgccaaat tctctatgag aactgcccaa gcaaagtagc atggtgcgaa 780 
atagtctgca gccactagtc ctgggactgc tcggacacgg attcccggtg gatcgagacc 840 
ggtctcactg tctagagata agtgctccaa ccagcacaat ggcctataaa tcaaattaca 900 
acaattaaac gaccaagtat acacttcaaa ctaattcaga attgagaaaa tcgaaatgct 960 
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aaccagaaaa tcatgtaaat caaaaaccgt 
gaatccatgt taaaaccata accaaaaata 
gttaaaactg atattctaaa tttagtaagt 
tttgaacata aatatatttt atagttttgt 
5 caaaacttac ggtttccatt tccaaattcg 
acgattaatt attaatttcc taaaaccatc 
aactagttgc aaaacaataa caaaacccaa 
aaatccaaag accatccata acaacaacag 
aaaccaagct taaccaatta tacatacccc 
10 gggtctataa aaatatctcg aaccgaacat 
tcggagaagc tagcacccca aagacgttta 
tcccaaagct cgagcccgcc ggttacaatc 



aacaatcaat atatatatat atattttcca 1020 
tatgaaaatt tagaaatact aaaataatat 1080 
tttaaaatgc aatgaaatcg tcattcatgt 1140 
aggacgattt tctacttcct atatagaaat 1200 
aatgacattt aaaaacatat cccaaaaatc 1260 
catcattact tagaaaataa tattttcata 1320 
agaaccatct ccacccatta accaaaatga 1380 
tataacacta cgtaaagcca attcaagaag 1440 
taaccagacg aaccaaacca atatctgacc 1500 
aacggtctaa tgtgttacct tctaagaatc 1560 
cgaaagagtc cttcagcgca aggccgacct 1620 
cccggaacaa gaatcaccgg atgaaacgcc 1680 



<210> 12 
<211> 264 
15- <212> DNA 

<213> Glycine max 

<220> 

<221> unsure 
<222> (39) 
20 <223> n»uxiknown 

<220> 

<221> unsure 
<222> (175) 
<223> naunknown 

25 <220> 

<221> unsure 
<222> (241) 
<223> nsunknown 

<400> 12 
30 ccaagaactc gatgattact 
ttccaccaac tctcttctct 
cacccctggt agattcatta 
ccccttatga ctttagatat 
ngttcctcaa agatctaaag 



tcaacactcc tggggttgng 
catctcaatc ctcgtctcaa 
caaaagcttg gctacgctga 
ggtctagctg ctgaaggtca 
aatt 



acccgggtcc ctcactttgg 60 
gcatatcacc ggatacatgg 120 
tggtgagact ctgtntggag 180 
cccttcacaa gtgggttcca 240 

264 



35 <210> 13 

<211> 273 
<212> DNA 
<213> Glycine max 

<220> 
40 <221> unsure 
<222> (12) 
<223> nsunknown 

<220> 

<221> unsure 
45 <222> (33) 

<223> n«unknown 



r*rWMr». .tun n««»iAf»Ain i ^ 
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<220> 

<221> unsure 
<222> (252) 
<223> n^unknown 

<220> 

<221> unsure 
<222> (265) . . (266) 
<223> n»unknown 

<220> 

<221> unsure 
<222> (272) 
<223> n«unkhovni 

<400> 13 

ccaacatctg anaggggtag agtaggtatt tcnatctatt atccatattg tgatgaagaa 60 

ggaacaagaa gagggtctca agattgaggt tgctacactc acagttacag tagttgttgt X20 

gatgctgtca ttgctatgca catgtggggc aagcaacctc gaccctttga ttctaatacc 180 

aggtaacgga gggaaccaac tagaagcaag gttgaccaat cagtacaagc cctctacttt 240 

catctgcgat cntggtaccc tctcannaag ana 273 



<210> 14 

<211> 419 

<212> DNA 

<213> Glycine max 

<220> 

<22l> unsure 
<222> (99) 
<223> n»unknown 

<220> 

<22l> unsure 
<222> (346) 
<223> n-unknown 

<220> 

<221> unsure 
<222> (352) 
<223> n»unknown 

<220> 

<221> unsure 
<222> (392)^ 
<223> nssunknown 

<220> 

<22X> unsure 



<222> (405) 
<223> n^unknovin 
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<220> 

<221> unsure 
<222> (418) 
<223> n»unknown 



5 <400> 14 

gctgcatatg attggagaat agcatttcag 
cggataaaaa gcaacataga acttatggtt 
attccacatt caatgggggt cttgtacttc 
gctccaatgg gtggtggggg aggaccagat 
10 aacattggtg gaccattttt aggtgttccc 
gccaaggata ttgctgttgc caggacgata 
ccgcattcaa acccttgcaa catgtaatga 



aacactgagg tgagggatca aacactaagt 60 
gctactaang gtggaaataa ggcagttatt 120 
ctacatttta tgaaatgggt tgaagcacca 180 
tggtgctcca aatatataaa ggcagttgta 240 
aaggctatag cagggctatt ctcagctgag 300 
gctccaggat ttttanataa cnatctgttt 360 
anatgaaccc gttcnttggg actcaacna 419 



<210> 15 
<211> 272 
15 <212> DNA 

<213> Glycine max 

<220> 

<221> unsure 
<222> (1) . . (272) 
20 <223> n=unknovm 



<400> 15 

tganttgatc ntgngaagtn attctgtgta 
gtaagtgang ggtntgaaga gggaaagacg 
agcaacaggt tgcaggaaac aatggatcgg 
25 gccgcaggnn ggaagacaat aaacattata 
atgtttcntg tgcctgcaaa gcgatatttt 



ttanttccat gacatgaccg ttnnag9.tnc 60 
ctttttggtn ttngatatga ttttcgccaa 120 
ttggctgcna agttagaatc aanttataat 180 
nttcattcta tgggcggtct tttccnngan 240 
ga 272 



<210> 16 

<211> 237 

<212> DNA 

30 <213> Glycine max 

<220> 

<221> unsure 

<222> (1) . . (237) 

<223> n=sunknovm 



35 <400> 16 

gattttcgcc aaagcaacag 
tcaatntata atgcngcagg 
cttttggtga aatgnttcat 
tgggttgcaa tttgtgcgcc 



gttgcaggaa acaatggatc 
agggaagana ataaacatta 
gtgcctgcaa agcgatattt 
attccagggt gcaccaggaa 



ggttggctgc aaagttagaa 60 
taactcattc tatgggcggt 120 
ttgagaaata tgttaagaat 180 
ccatcaattc naccttt 237 



40 <210> 17 
<211> 244 
<212> DNA 
<213> Glycine max 
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<220> 

<221> unsure 
<222> (1) . . (244) 
<223> n^^unknown 



5 <400> 17 

gattttcgcc aaagcaacag gttgcaggaa 
tgcaatttat aatgctgcag gagggaagaa 
tcttttggtg aaatgtttca tgtgcctgca 
ttgggttgca atttgtgcgc cattccaggg 
10 aaat 



acaatggatc ggtnggctgc aaagttagaa 60- 
aataaacatt ataactcatt ctatgggcgg 120 
aagcgatatt tttgagaaat atgttaagaa 180 
tgcaccagga accatcaatt ctaccttttt 240 

244 



<210> 18 

<211> 263 

<212> DNA 

<213> Glycine max 

15 <400> 18 

gatgaaacta aaccgtgggc gactaagctt 
gttcgttgct tcatagaaga ggtcattggt 
ggaggattgg ttgcattgta ttttgcggca 
ttgcttaagc aacacctttt tgggggtttc 

20 cgaaaatatt tccatgggcc gga 



gtttactcgg ttgatttatg gcaagatcaa 60 
gaaccagtct atcttgtggg caactcacta 120 
aacaaccctc atttagtgaa aggtgtcgca 180 
tgccaaatcc cataaaaagt ccaagactag 240 

263 



<210> 19 
<211> 311 
<212> DNA 
<213> Zea mays 



25 <220> 

<221> unsure 
<222> (1) , , (311) 
<223> nsunknown 

<400> 19 

30 cggacgctgg ncatgttcgg agccccctac gacttccgct acgcgccgcc gtcccccggc 60 

cagacgtccg aggtgtactc ccgctacttc aaggagctga tggagctggt cgaggccgcg 120 

agcgagagga cccggaagaa ggccgtcatc ctcggccaca gcttcggcgg catggtcgcg 180 

ctcgagttcg tccggaacac tccgccggcg tggcggcgcg agcacatcga gcgcctcgtc 240 

ctggtcgcgc cgacgctccc cggcgggttc ctggagccgg tgcgcaactt cgcgtccggg 300 

35 acggacatcc t 311 



<210> 20 
<:211> 1155 
<212> DNA 
<213> Zea mays 

40 <400> 20 

tcgacccacg cgtccggcca caagaaccct 
agagccgcac tggaagacat gggataccga 
gacttccgct acgcgccgcc gtcccccggc 
aaggagctga tggagctggt cgaggccgca 

45 ctcggccaca gcttcggcgg catggtcgcg 



ctcaagtcag actggtgcct cggaaagctg 60 
gacggagaca ccatgttcgg agccccctac 120 
cagacgtccg aggtgtactc ccgctacttc 180 
agcgagagga cccggaagaa ggccgtcatc 240 
ctcgagttcg tccggaacac tccgccggcg 300 
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tggcggcgcg agcacatcga gcgcctcgtc 
ctggagccgg tgcgcaactt cgcgtccggg 
ccgctggcca cgcgagccat gtgrag.ragc 
ccggccgtgt tcgggcgcct gcaggcgccg 
5 gcgtccgcgc acgacatgga gcgcttcctc 
cccttcagga gacgggccgt ccccaagatg 
acgtacatca gcggggtcgg caacaggacg 
ttcgacgcgg ccccggaggt ggcggcgtac 
agcgtcttgg cgtttgagaa ggagatgcgt 

10 tccatcaaga tcgataaggc ccagcattct 
agggtcattc aagaaattgt tgaggccaat 
gtcatgtatg cattaccgag ctgtgggggc 
tccgtgctta aaacggtcgt ggtgtggtct 
caattgcttg cctcaaaaaa aaaaaaaaaa 

15 aaaaaaaaar gggcg 



ctggtcgcgc cgacgctccc cggcgggttc 3 60 
acggacatcc tratacgtgcc agcgacgacg 420 
ttcgagagcg ccatcgtgaa cttcccgtcg 480 
ctcgtggtca ccagggagcg gaactactcc 540 
gccgccgtcg gctccggcga ggccgcggag 600 
ggcagcttcg cggcgccgat ggtgcccatg 660 
ccgctgcggc tggtgttctg gggcgacgac 720 
ggggacggag atggcaagat caatttgatc 780 
cggcagccgg agcagaagaa gcagttcaaa 840 
acgatcgtca cggatgattt tgccctgcac 900 
aatcagaaga ttccatccta aattcttcat 960 
caatagtggg ttgggagttg ggacatcggt 1020 
caattcaatc gattagttat ttgttaacgt 1080 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1140 

1155 



<210> 21 
<211> 328 

<2i2> mh 
<213> Zea mays 

20 <400> 21 

gttggaatgc tcttcaactt tcttacgctc 
ctggcttaaa tgcatgctgc ttgaacccta 
aaggcctgat agtggtcttc tgcaattaca 
ctctcttcag tatggaaaga atgggtcaaa 

25 gcaattatcg ctgttccgta tgattggaga 
ctgtactttc acaattaaac aggatcag 



atttacattg cttttctctg cggtaaattg 60 
taatcagata gaccatcccg aatgcaagtc 120 
gagctggacc ctggttatat aacaggtcct 180 
tggtgtgtag agtttggcat tgaagctaat 240 
ctgcccccat caatgcttga ggagagagat 300 

328 



<210> 22 
<211> 356 
<212> DNA 
30 <:213> Zea mays 



<400> 22 

gtctttctgc aattacagag ctggaccctg 
tcagtatgga aagaatgggt caaatggtgt 
atcgctgttc cgtatgattg gagactgccc 
35 tttcacaaat taaagtttgt aacacttgcc 
catctgttag gatcagtaag agttaatggc 
cagatcccac aagcatacgg ttaccgccaa 



<210> 23 
<211> 1552 
40 <212> DNA 

<213> Zea mays 



gttatataac aggtttcagg tcctctctct 60 
gtagagtttg gcattgaagc taatgcaatt 120 
ccatcaatgc ttgaggagag agatctgtac 180 
tcaacttgtt atgaagcaac caatgctata 240 
ccatgacgga ttcaggttcc tgctcaccaa 300 
tgcctgcagt tggacagtac caaccc 356 



<400> 23 

tcgacccacg cgtccgcaga catgatcatt 
atgatagtgg aaatgattaa atggggatat 
tatgatttcc gtcaaagcaa caggctctca 
gagtcagtgt acacagcttc tggtggaaag 
ggattacttg tgaaatgttt catctcactg 



ggtgatgaca ctgtgtacta ctatcatgac 60 
caagaaggaa aaactctctt tggatttggt 120 
gagacacttg acagattttc taaaaagctg 180 
aagatcaatc tcattactca ttcaatgggg 240 
cacagtgata tatttgaaaa atatgtcaar 300 
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agttggatcg caattgctgc accattccar ggtgcccctg ggtamataac taccaktytg 360 
ctgaatggaa tgtcttttgt craaggatgg gaaycaagat tctttatttc caaawkgkgt 420 
atgcaacaat tgctacttga gtgcccatca atctatgagk tgctgscaam ccctaacttt 480 
ccagtggaga gacatcccac tgctacagat ttggagagag aatttggata mcagtggcaa 540 
5 gaaaagtgcc ctgttagagt cgtatgagcc tgaggaagca ataaagatga ttaaagaggc 600 
tctttccagt aatgagatca ttgctgatgg catgcatatt ccggtgcccc ttaatttgga 660 
tatattgaat tgggcaaaga aacttatgat cttttatgca gtacaaagct tccggaatca 720 
gtgaaattct acaacattta tgggattgat tatgatactc cacatactgt ctgctatggc 780 
agtgaacagc agccggtttc aagtcttagt agcctcttat atgctcaggg aaaatacgtc 840 

10 tatgttgatg gcgacggatc tgttcccgca gaatcagcaa aggctgacgg atttaatgca 900 
gtggcaaggg ttggggttgc tcctgaccac cggggaatcg tgtgcagtcg ccgcgcgttc 96.0- 
cggatcgtcc agcactggct gcacgccgga gaacctgacc cgttctacga cccgctgagc 1020 
gactatgtca tactcccaac acgcttacga aatcgagaag catcgtgaga aacacgggga 1080 
tgtcacgtca gtagcggagg actgggagat catctcccct aacgacggca agaccatrrg 1140 

15 gccaggcgag cttcctccta tggtcagcac actgaccacg agccgggaag gcaaggaggg 1200 
agcactggaa gaggcgcatg ccaccgtggt cgttcacccg gagaagaagg gacggcagca 1260 
tgtgcaagtt agggctgtgg gtgtcagcca tggtggctaa agccgtagga gccacgttgg 1320 
ttgtctactc tatctagcag tagcagctat acctctgtgc acgcactgta aaattggatg 1380 
tacatatatg gctatgacct ctgtagggat ctggttttag aagtataaat gggcaccctg 1440 

20 cctgcttgta aatgttcaga accgaaaaca caggccctgt tctttttttt cctttttaaa 1500 
aaaaataaaa agatggtaaa ggattccatt aaaaaaaaaa aaaaaaaagg eg 1552 



<210> 24 
<211> 227 
<212> DNA 
25 <213> Zea mays 

<220> 

<221> unsure 

<222> (1) . . (227) 
<223> n=unknovm 

30 <400> 24 

ttggttatga tttccgtcaa agcaacaggc 
agctggagtc agtgtacaca gcttctggtg 
tggggggatt acttgtgaaa tgtntcatct 
tcaagagttg gntcgcaatt gcngcaccat 



tctcagagac acttgacaga ttttctaaaa 60 
gaaagaagat caatctcatt actcattcaa 120 
cactgcacag tgatatatnt gaaaaatatg 180 
tccaaggtgc ccctggg 227 



35 <210> 25 
<211> 1587 
<212> DNA 
<213> Zea mays 

<220> 
40 <221> unsure 

<222> (1) . . (1587) 
<223> n=un)cnown 

<400> 25 

ggagattgtc gtgccggagg acgaccacgg cctgtttgcc atcgacattc ttgatccttc 60 

45 ctggtttgta gaactcgacc cacgcgtccg cccaccgtcc gggagattgt cgtgccggag 120 

gacgaccacg gcctgtttgc catcgacatt cttgatcctt cctggtttgt agaacttctc 180 

catctgtcta tggtgtatca cttccatgat atgattgata tgctcataaa ctgtggatat 240 

gagaaaggga ccacactatt tggatatggt tatgattttc gccaaagcaa caggatagac 300 
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aaagcgatgg ctggtttgag agcaaaactt 
aaagttaatt taatctcaca ttctatgggt 
aatcatgatg tattcactaa gtatgtcaac 
ggtgcccccg gatgcatcaa tgattctcta 
5 gagagcttct ttttcgtatc tagatgggca 
atctatgaaa tgttaccaaa tccagaattc 
tggcgtaaga accctgaaaa ggatggaact 
tgtgtgtcct tgttcgaaga agctttaagg 
gtagcactac cattcaatat gtcagtcttc 

10 gacaatgctg aattaccaga tactgtgagc 
actccatacg atgtatgcta tggctcagaa 
tgtcacacag tgccggcata cacttatgtg 
gcacgggctg atgggttctc tgcgaaagaa 
ctgctgtccg atgagaacgt attcaagctt 

15 aagtcagagt ggcgttgcgt gtctaaatcc 
tagttcttca ggaagactgt tattttggcc 
tctggtgatt gtattgtatg tctgcacgat 
gattagtgcc cttgtacgat gtatcattgg 
tttntttgnc cttccactct ggacntagta 

20 zmgtctgnnn nnnnnnnnnn nnnnnnnimn 
niummmnrm zinaaaaaaaa awgkgaagcc 
aaaaaaaaaa aaaaaaaaaa aaaaaaa 



<210> 26 
<211> 300 
25 <212> DNA 

<213> Zea mays 



gagacagctc ataagacctc tggagggaaa 360 
ggattgctag tacgctgctt catgtctatg 420 
aaatggattt gcattgcttg tccattccaa 480 
cttactggat tgcaatttgt ttatggtttt 540 
atgcaccaat tgcttgtcga atgcccatca 600 
aagtggaagg aaaaaccaat tattcaggtt 660 
gtggagcttg ttcaatatga agcaactgat 720 
aataatgagc tcacgtataa cggaaagaaa 780 
aaatgggcca ccaagactcg ccaaatccta 840 
ttttacaata tatacgggac atcttatgaa 900 
agctctccga ttggagattt gtcagaagtg 960 
gatggagatt gcacggttcc catagaatcg 1020 
agagttggcg tcaaggcgga ccaccgtggc 1080 
etcaagaaat ggctcggtgt gagcgagaag 1140 
tactccaaag tgacctaatt gggttgcctg 1200 
tttcctcctg aagagaagat gaaacaaaat 1260 
gtaaatctct gcaagctgca cggaacaagg 1320 
caggcatttn tttttgaacc tangggcata 1380 
aagaatatnt gaatcgacct tanttnnaan 1440 
nnnnnnnnnn nnnnnnnnnn nnnnnixnnnn 1500 
gntnntnntt tnaaaagxmt tttnnnaaaa 1560 

1587 



<220> 

<221> unsure 
<222> (!) . . (300) 
30 . <223> n»unknovm 



<400> 26 

gacaaagcga tggctggttt gagagcaaaa 
aaaaaagtta atttaatctc acattctatg 
atgaatcatg atgtgagttt tcatgttttc 
35 atgtcaattt cccccatttt ctaggtattc 
gcttgtccat tccaaggtaa cttatgggac 



cttgagacag ctcataagac ctctggaggg 60 
ggtggattgc tagtacgctg cttcatgtct 120 
tgtgtttttt ttgcttttgc ataaatatcc 180 
actangtatg tcaacaaatg gatttgcatt 240 
atttcaattg tttattanat natggggncc 300 



<210> 27 
<211> 1240 
<212> DNA 
40 <213> Zea mays 

<220> 

<221> unsure 

<222> (1) . . (1240) 
<223> nesunknown 

45 <400> 27 

tcgacccacg cgtccggttc ccagttccca ccgtgtagat ggttctggta taaaatgtat 60 

tgccatattt gtaacacaga ttactatata caggttcgtg atcaaacttt gagcagaata 120 

aagascaata ttgaactcat agwagsgaca aatggtggaa atagggtggt ggkmgatccc 180 
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acnactccat ggggtcnttn attttntgcn ttttacgnaa tggntcgaag ccctcctccg 240 
tgggggcagt gggtccgaac tggntgtaga accatataaa gctgtaatga atattggagg 300 
atctttctta ggagttccta aggctgttgc tgggcttttt ttcttctaag caaaagatgt 360 
tgccggttgc taggtataag taatgattca tttatttaaa gcaaaaggga atagcaaaag 420 
5 aatgaatatt attggatgct cgacaagctt gcggagcttt tgctcccaag ccatcttctg 4 80 
gacctcacaa gtccagggag tgcctgcctc tgatcctcat catcaggaac aggctcaagt 540 
atgcaccgac ggtaccgtga ggtcatttct atcctgatgc aacaccatgt acttgttgat 600 
ggcaaggtca ggactgacaa gacctaccct gctgggttca tggatgtcat ttccatccct 660 
aagacaaacg agaactacag gctgctttcg tcttcaccca atcagggatg aggatgccaa 720 

10 gttcaagctc tacaaggtga ggtctgttca gtttggccag aaagacatcc cctatctgaa 780 
cacctacgac gaccgcacca tccgctaccc cgacccgctc atcaaggcca acgacaccat 840 
caagatcgat ctggagacca acaagatcat ggacttcatc atgtttgacg tcggcaacgt 900 
ggtcatggtg atcggcagga ggaataccgg gcgtgtagga gtgatcaara taagggagaa 960 
gcataagggc aacttcgaga ccatccacgt gctgcttgra gctttttgct atgtctagtt 1020 

15 ttctcctatt tgttgtacag gaaaacatag aatgaaattc aaatttggtg gccacaaaag 1080 
tgtggagact tgatttcata taaagttagg cttaacatta gtgcaaacag ttgtatttta 1140 
gtttagattt agagtacact atgtatgcgt tgtttgacaa tgcttattta tgatatattg 1200 
aatggtactt atttatatta attaattaaa aaaaaaaaaa 1240 



<210> 28 
20 <211> 324 
<212> DNA 
<213> Zea mays 

<400> 28 

cgaatgctcc tgacatggaa atattttcca tgtacggagt aggcattcct actgaaaggg 60 

"25 catatgtcta taagttggcc ccacaggcag aatgttatat acctttccga attgacacct 120 

cggctgaagg cggggaggaa aatagctgct tgaaaggggg tgtttactta gccgatggtg 180 

atgaaactgt tccagttctt agtgcgggct acatgtgtgc aaaaggatgg cgtggcaaaa 240 

ctcgtttcaa ccctgccggc agcaagactt acgtgagaga atacagccat tcaccaccct 300 
ctactctcct ggaaggcagg ggca 324 



30 <210> 29 
<211> 254 
<212> DNA 
<213> Zea mays 

<400> 29 

35 gaataaagag caacattgaa ctcatggtag caacaaatgg tggaaatagg gtggtggtga 60 

tcccacactc catgggggtc ctctattttt tgcattttat gaaatgggtc gaagcacctc 12a 

ctcccatggg gggtggcggt ggtccagact ggtgtgagaa gcatattaaa gctgtaatga 180 

atattggagg acctttctta ggagttccta aggctgttgc tggccttttc tcatctgaag 240 
ccaaagatgt tgcc 254 



40 <210> 30 

<211> 518 

<212> DNA 

<213> Mus musculus 

<400> 30 

45 tggaggacaa cgcggggtct. gatacgactc actataggga atttggccct cgagcagtag 60 

attcggcacg atgggcacga ggactccatc atgttcctca agctttattc ctaccgggat 120 

gtcaacctgt ggtgccgcca gcgaagggtc aaggccaaag ctgtctctac agggaagaag 1.80 
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gtcagtgggg, ctgctgcgag caagctgtga gctatccaga caacctgacc taccgagatc 240 
tcgattactt catctttgct cctactttgt. gttatgaact caactttcct cggtcccccc 300 
gaatacgaga gcgctttctg ctacgacgag ttcttgagat gctctttttt acccagcttc 360 
aagtggggct gatccaacag tggatggtcc ctactatcca gaactccatg gaagcccttt 420 
5 caagagcttc tgcagttttg gagaccgcga gttctacaga gattggtgga atgctgagtc 480 
tgtcaccgac ttttggcaga actggaatat ccccgtgg 518 



<210> 31 
<211> 299 
<212> DNA 
10 <213> Mus musculus 

<400> 31 

ccatgatggc tcaggtccca ctggcctgga ttgtgggccg attcttccaa gggaactatg 60 
gcaatgcagc tgtgtgggtg acactcatca ttgggcaacc ggtggctgtc tcatgtatgt 120 
ccacgactac tacgtgctca actacgatgc cccagtgggt catgagctac tgccaaaggc 180 
15 agccctccct aacctgggcc tggagttctg gaggggttcc tggctgcctg cacactcctc 240 
ctagtctggg aggcctctct gcccctatgc gctactcctg ctcttgggga tggcatttg 299 



<210> 32 
<211> 1895 
<212> DNA 
20 <213> Artificial Sequence 

<220> 

<:223> Description of Artificial Sequence: Inferred cDNA 
sequence 

<220> 
25 <221> unsure 

<222> (1) (1895) 
<223> nts^unknown 

<400> 32 

gtctggtgtg atggggacag ggagggactt ccccttaccc agcactggtg ttggctgagg 60 

30 tgggtgctga gtctcagagc ttggcatgga gaccagacag ggctgggtct gcaagcctga 120 
ggctgccgcc ctgagctcgg gctgggacgt gcccagaggt gttgggagga tctggggtga 180 
gtaccctgtg gccaggacta aaggggctnc accctcctgt ccatccctcg cagatcttga 240 
gcaatgcccg gttatttctg gagaacctca tcaagtatgg catcctggtg gaccccatcc 300 
aggtggtttc tctgttcctg aaggatccct atagctggcc cgccccatgc ctggttattg 360 

35 cggccaatgt ctttgctgtg gctgcattcc aggttgagaa gcgcctggcg gtgggtgccc 420 
tgacggagca ggcgggactg ctgctgcacg tggccaacct ggccaccatt ctgtgtttcc 480 
cagcggctgt ggtcttactg gttgagtcta tcactccagt gggctccctg ctggcgctga 540 
tggcgcacac catcctcttc ctcaagctct tctcctaccg cgacgtcaac tcatggtgcc .600 
gcagggccag ggccaaggct gcctctgcag ggaagaaggc cagcagtgct gctgccccgc 660 

40 acaccgtgag ctacccggac aatctgacct accgcgatct ctactacttc ctcttcgccc 720 
ccaccttgtg ctacgagctc aactttcccc gctctccccg catccggaag cgctttctgc 780 
tgcgacggat ccttgagatg ctgttcttca cccagctcca ggtggggctg atccagcagt 840 
ggatggtccc caccatccag aactccatga agcccttcaa ggacatggac tactcacgca 900 
tcatcgagcg cctcctgaag ctggcggtcc ccaatcacct catctggctc atcttcttct 960 

45 actggctctt ccactcctgc ctgaatgccg tggctgagct catgcagttt ggagaccggg 1020 
agttctaccg ggactggtgg aactccgagt ctgtcaccta cttctggcag aactggaaca 1080 
tccctgtgca caagtggtgc atcagacact tctacaagcc catgcttcga cggggcagca 1140 
gcaagtggat ggccaggaca ggggtgttcc tggcctcggc cttcttccac gagtacctgg 1200 
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tgagcgtccc tctgcgaatg ttccgcctct 
cactggcctg gttcgtgggc cgctttttcc 
tgtcgctcat catcggacag ccaatagccg 
tcaactatga ggccccagcg gcagaggcct 
5 gccacctcac acccgctgcc agagcccacc 
ggcctggctg cacagcatcc tcctctggtc 
ctgtcctgca cccctcaggg atggcgacag 
agtcttgctg accctgcccc gggtccgagg 
tcagcctgcc aggggcctgg ggcctggtgg 
10 ccagagctgt gtggacagct gtcccaggac 
gggcgggcat ggccggtagg gggagtgcaa 
cccctaccta gaaaagctca gctcaggcgt 



gggcgttcac gggcatgatg gctcagatcc 1260 
agggcaacta tggcaacgca gctgtgtggc 1320 
tcctcatgta cgtccacgac tactacgtgc 1380 
gagctgcacc tgagggcctg gcttctcact 1440 
tctcctccta ggcctcgagt gctggggatg 1500 
ccagggaggc ctctctgccc ctatggggct 1560 
caggccagac acagtctgat gccagctggg 1620 
gtgtcaataa. agtgctgtcc agtgacctct 1680 
ggggtatggc cacacccaca agggcgagtg 1740 
ctgccgggga gcagcagctc cactgcagca 1800 
ggccaggcag acgcccccat tccccacact 1860 
cctct 1895 



<210> 33 
<211> 1766 
15 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Inferred cDNA 
sequence 

20 <400> 33 

cacgactggg ccgcgacgtg gtgcgggccg aagccatggg cgaccgcgga ggcgcgggaa 60 
gctctcggcg tcggaggacc ggctcgcggg tttccatcca gggtggtagt gggcccatgg 120 
tagacgaaga ggaggtgcga gacgccgctg tgggccccga cttgggcgcc gggggtgacg 180 
ctccggctcc ggctccggtt ccggctccag cccacacccg ggacaaagac cggcagacca 240 

25 gcgtgggcga cggccactgg gagctgaggt gccatcgtct gcaagactct ttgttcagct 300 
cagacagcgg tttcagcaat taccgtggta tcctgaattg gtgcgtggtg atgctgatcc 360 
tgagtaatgc aaggttattt ttagagaatc ttatcaagta tggcatcctg gtggatccca 420 
tccaggtggt gtctctgttt ctgaaggacc cctacagctg gcctgcccca tgcttgatca 480 
ttgcatccaa tatctttatt gtggctacat ttcagattga gaagcgcctg tcagtgggtg 540 

30 ccctgacaga gcagatgggg ctgctgctac atgtggttaa cctggccaca attatctgct 600 
tcccagcagc tgtggcctta ctggttgagt ctatcactcc agtgggttcc ctgtttgctc 660 
tggcatcata ctccatcatc ttcctcaagc ttttctccta ccgggatgtc aatctgtggt 720 
gccgccagcg aagggtcaag gccaaagctg tgtctgcagg gaagaaggtc agtggggctg 780 
ctgcccagaa cactgtaagc tatccggaca acctgaccta ccgagatctc tattacttca 840 

35 tctttgctcc tactttgtgt tatgaactca actttcctcg atccccccga atacgaaagc 900 
gctttctgct acggcgggtt cttgagatgc tctttttcac ccagcttcaa gtggggctga 960 
tccagcagtg gatggtccct actatccaga actccatgaa gcccttcaag gacatggact 1020 
attcacgaat cattgagcgt ctcttaaagc tggcggtccc caaccatctg atatggctca 1080 
tcttcttcta ttggcttttc cactcatgtc tcaatgctgt ggcagagctc ctgcagtttg 1140 

40 gagaccgcga gttctacagg gactggtgga atgctgagtc tgtcacctac ttttggcaga 1200 
actggaatat ccccgtgcac aagtggtgca tcagacactt ctacaagcct atgctcagac 1260 
tgggcagcaa caaatggatg gccaggactg ggg'tcttttt ggcgtcagcc ttcttccatg 1320 
agtacctagt gagcattccc ctgaggatgt tccgcctctg ggcattcaca gccatgatgg 1380 
ctcaggtccc actggcctgg attgtgaacc gcttcttcca agggaactat ggcaatgcag 1440 

45 ctgtgtgggt gacactcatc attgggcaac cggtggctgt gctcatgtat gtccacgact 1500 
actacgtgct caactatgat gccccagtgg gggcctgagc tactgccaaa ggccagccct 1560 
ccctaacctg ggcctggagt tctggagggc ttcctggctg cctgcacact cctcctagtc 1620 
tgggaggcct ctctgcccct atggggccta ctcctgctct tggggatggc acctgagtcc 1680 
agctggtatg agccagtgct gggagtctgt gctgaccagg ggctgaggat atcaataaag 1740 

50 agctatctaa aaaaaaaaaa aaaaaa 1766 
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<210> 34 
<211> 409 
<212> PRT 

<213> Homo sapiens 
5 <400> 34 

Arg Arg Ser Leu Leu Aap Glu Leu Leu Glu Val Asp His He Arg Thr 
1 5 10 15 

He Tyr His Met Phe He Ala Leu Leu He Leu Phe He Leu Ser Thr 
20 25 30 

10 Leu Val Val Asp Tyr He Asp Glu Gly Arg Leu Val Leu Glu Phe Ser 
35 40 45 

Leu Leu Ser Tyr Ala Phe Gly Lys Phe Pro Thr Val Val Trp Thr Trp 
50 55 60 

Trp He Met Phe Leu Ser Thr Phe Ser Val Pro Tyr Phe Leu Phe Gin 
15 65 70 75 80 

His Trp Arg Thr Gly Tyr Ser Lys Ser Ser His Pro Leu He Arg Ser 
85 90 95 

Leu Phe His Gly Phe Leu Phe Met He Phe Gin He Qly Val Leu Gly 
100 105 110 

20 Phe Gly Pro Thr Tyr Val Val Leu Ala Tyr Thr Leu Pro Pro Ala Ser 

115 120 125 

Arg Phe He He He Phe Glu Gin He Arg Phe Val Met Lys Ala His 
130 135 140 

Ser Phe Val Arg Glu Asn Val Pro Arg Val Leu Asn Ser Ala Lys Glu 
25 145 150 155 160 

Lys Ser Ser Thr Val Pro He Pro Thr Val Asn Gin Tyr Leu Tyr Phe 
165 170 175 

Leu Phe Ala Pro Thr Leu lie Tyr Arg Asp Ser Tyr Pro Arg Asn Pro 
180 185 190 

30 Thr Val Arg Trp Gly Tyr Val Ala Met Lys Phe Ala Gin Val Phe Gly 
195 200 205 

Cys Phe Phe Tyr Val Tyr Tyr He Phe Glu Arg Leu Cys Ala Pro Leu 
210 215 220 

Phe Arg Asn He Lys Gin Glu Pro Phe Ser Ala Arg Val Leu Val Leu 
35 225 230 235 240 

Cys Val Phe Asn Ser IJe Leu Pro Gly Val Leu He Leu Phe Leu Thr 
245 250 255 . 



Phe Phe Ala Phe Leu His Cys Trp Leu Asn Ala Phe Ala Glu Met Leu 
260 265 270 



wo 01/16308 



25 



PCTAJSOO/23863 



Arg Phe Gly Asp Arg Met Phe Tyr Lys Asp Trp Trp Asn Ser Thr Ser 
275 280 285 

Tyr Ser Asn Tyr Tyr Arg Thr Trp Asn Val Val Val His Asp Trp Leu 
290 295 300 

5 Tyr Tyr Tyr Ala Tyr Lys Asp Phe Leu Trp Phe Phe Ser Lys Arg Phe 
305 310 315 320 

Lys Ser Ala Ala Met Leu Ala Val Phe Ala Val Ser Ala Val Val His 
325 330 335 

Glu Tyr Ala Leu Ala Val Cys Leu Ser Phe Phe Tyr Pro Val Leu Phe 
10 340 345 350 

Val Leu Phe Met Phe Phe Gly Met Ala Phe Asn Phe He Val Asn Asp 
355 360 365 

Ser Arg Lys Lys Pro He Trp Asn Val Leu Met Trp Thr Ser Leu Phe 
370 375 380 

15 Leu Gly Asn Gly Val Leu Leu Cys Phe Tyr Ser Gin Glu Trp Tyr Ala 
385 390 395 400 

Arg Arg His Cys Pro Leu Lye Asn Pro 
405 



<210> 35 
20 <211> 409 

<212> PRT . 
<213> Mus muBCulus 

<400> 35 

Arg Oln Ser Leu Leu Asp Glu Leu Phe Glu Val Asp His He Arg Thr 
25 1 5 . 10 15 

He Tyr His Met Phe. He Ala Leu Leu He Leu Phe Val Leu Ser Thr 
20 25 30 

He Val Val Asp Tyr He Asp Glu Gly Arg Leu Val Leu Glu Phe Asn 
35 .40 45 

30 Leu Leu Ala Tyr Ala Phe Gly Lys Phe Pro Thr Val He Trp Thr Trp 
50 55 60 

. Trp Ala Met Phe Leu Ser Thr Leu Ser He Pro Tyr Phe Leu Phe Gin 
€5 70 75 80 

Pro Trp hist His Gly Tyr Ser Lys Ser Ser His Pro Leu He Tyr Ser 
35 85 90 95 

Leu Val His Gly Leu Leu Phe Leu Val Phe Gin Leu Gly Val Leu Gly 
100 105 110 
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Phe Val Pro Thr Tyr Val Val Leu Ala Tyr Thr Leu Pro Pro Ala Ser 
. 115 120 125 

Arg Phe He Leu He Leu Glu Gin He Arg Leu He Met Lys Ala His 
130 135 140 

5 Ser Phe Val Arg Glu Asn He Pro Arg Val Leu Asn Ala Ala Lys Glu 
145 150 155 160 

Lys Ser Ser Lys Asp Pro Leu Pro Thr Val Asn Gin Tyr Leu Tyr Phe 
165 170 175 

Leu Phe Ala Pro Thr Leu He Tyr Arg Asp Asn Tyr Pro Arg Thr Pro 
10 180 185 190 

Thr Val Arg Trp Gly Tyr Val Ala Met Gin Phe Leu Gin Val Phe Gly 
195 200 205 

Cys Leu Phe Tyr Val Tyr Tyr He Phe Olu Arg Leu Cys Ala Pro Leu 
210 215 220 

15 Phe Arg Asn He Lys Gin Glu Pro Phe Ser Ala Arg Val Leu Val Leu 
225 230 235 240 

Cys Val Phe Asn Ser He Leu Pro Gly Val Leu He Leu Phe Leu Ser 
245 250 255 

Phe Phe Ala Phe Leu His Cys Trp Leu Asn Ala Phe Ala Glu Met Leu 
20 260 265 270 

Arg Phe Gly Asp Arg Met Phe Tyr Lys Asp Trp Trp Asn Ser Thr Ser 

275 280 285 

Tyr Ser Asn Tyr Tyr Arg Thr Trp Asn Val Val Val His Asp Trp Leu 
290 295 300 

25 Tyr Tyr Tyr Val Tyr Lys Asp Leu Leu Trp Phe Phe Ser Lys Arg Phe 
305 310 315 320 

Lys Ser Ala Ala Met Leu Ala Val Phe Ala Leu Ser Ala Val Val His 
325 330 335 

Glu Tyr Ala Leu Ala He Cys Leu Ser Tyr Phe Tyr Pro Val Leu Phe 
30 340 345 350 

Val Leu Phe Met Phe Phe Gly Met Ala Phe Asn Phe He Val Asn Asp 
355 360 365 

Ser Arg Lys Arg Pro He Trp Asn He Met Val Trp Ala Ser Leu Phe 
370 375 380 

35 Leu Gly Tyr G3.y Leu He Leu Cys Phe Tyr Ser Gin Glu Trp Tyr Ala 
385 . 390 395 400 



Arg Gin His Cys Pro Leu Lys Asn Pro 
405 
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<210>. 36 
<211> 429 
<212> PRT 

<213> saccharomyces cerevisiae 

5 <400> 36 

Asp Lys Ala Asp Ala Pro Pro Gly Glu Lys Leu Glu Ser Asn Phe Ser 
15 10 15 

Gly lie Tyr Val Phe Ala Trp Met Phe Leu Gly Trp lie Ala He Arg 
20 25 30 

10 Cys Cys Thr Asp Tyr Tyr Ala Ser Tyr Gly Ser Ala Trp Asn Lys Leu 
35 40 45 

Glu He Val Gin Tyr Met Thr Thr Asp Leu Phe Thr He Ala Met Leu 
50 55 60 

Asp . Leu Ala Met Phe Leu Cys Thr Phe Phe Val Val Phe Val His Trp 
15 65 70 , 75 80 

Leu Val Lys Lys Arg He He Asn Trp Lys Trp Thr Gly Phe Val Ala 
85 .90 95 

Val Ser He Phe Glu Leu Ala Phe He Pro Val Thr Phe Pro He Tyr 
100 105 110 

20 Val Tyr Tyr Phe Asp Phe Asn Trp Val Thr Arg He Phe Leu Phe Leu 
115 120 125 

His Ser Val Val Phe Val Met Lys Ser His Ser Phe Ala Phe Tyr Asn 
130 135 140 

Gly Tyr Leu Trp Asp He Lys Gin Glu Leu Glu Tyr Ser Ser Lys Gin 
25 145 150 155 160 

Leu Gin Lys Tyr Lys Glu Ser Leu Ser Pro Glu Thr Arg Glu He Leu 
165 170 175 

Gin Lys Ser Cys Asp Phe Cys Leu Phe Glu Leu Asn Tyr Gin Thr Lys. 
180 185 190 

30 Asp Asn Asp Phe. Pro Asn Asn He Ser Cys Ser Asn Phe Phe Met Phe 
195 200 205 

Cys Leu Phe Pro Val Leu Val Tyr Gin He Asn Tyr Pro Arg Thr Ser 
210 215 220 

Arg He Arg Trp Arg Tyr Val Leu Glu Lys Val Cys Ala He He Gly 
35 225 230 235 240 

Thr He Phe Leu Met Met Val Thr Ala Gin Phe Phe Met His Pro Val 
245 250 255 

Ala Met Arg Cys He Gin Phe His Asn Thr Pro Thr Phe Gly Gly Trp 
260 265 270 
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lie Pro Ala Thr Gin Glu Trp Phe His Leu Leu Phe Asp Met lie Pro 
275 280 285 

Gly Phe Thr Val Leu Tyr Met Leu Thr Phe Tyr Met lie Trp Asp Ala 
290 295 300 

5 Leu Leu ABn Cys Val Ala Glu Leu Thr Arg Phe Ala Asp Arg Tyr Phe 
305 310 315 320 

Tyr Gly Asp Trp Trp Asn Cys Val Ser Phe Glu Glu Phe Ser Arg He 
325 330 335 

Trp Asn Val Pro Val His Lys Phe Leu Leu Arg His Val Tyr His Ser 
10 340 345 35a 

Ser Met Gly Ala Leu His Leu Ser Lys Ser Gin Ala Thr Leu Phe Thr 
355 360 365 

Phe Phe Leu Ser Ala Val Phe His Glu Met Ala Met Phe Ala He Phe 
370 375 380 

15 Arg Arg Val Arg Gly Tyr Leu Phe Met Phe Gin Leu Ser Gin Phe Val 
385 390 395 400 

Trp Thr Ala Leu Ser Asn Thr Lys Phe Leu Arg Ala Arg Pro Gin Leu 
405 410 415 

Ser Asn val Val Phe Ser Phe Gly Val Cys Ser Gly Pro 
20 420 425 



<210> 37 
<211> 432 
<212> PRT 

<213> Saccharomyces cerevisiae 
25 <400> 37 

Glu Thr Val Val Thr Val Glu Thr Thr He He Ser Ser Asn Phe Ser 
15. 10 15 

Gly Leu Tyr Val Ala Phe Trp Met Ala He Ala Phe Gly Ala Val Lys 
20 25 30 

30 Ala Leu He Asp Tyr Tyr Tyr Gin His Aan Gly Ser Phe Lys Asp Ser 
35 40 45 

Glu He Leu Lys Phe Met Thr Thr Asn Leu Phe Thr Val Ala Ser Val 
50 55 60 

Asp Leu Leu Met Tyr Leu Ser Thr Tyr Phe Val Val Gly He Gin Tyr 
35 65 70 75 80 

Leu Cys Lys Trp Gly Val Leu Lys Trp Gly Thr Thr Gly Trp He Phe 
85 90 95 
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Thr Ser He Tyr Glu Phe Leu Phe Val He Phe Tyr Met Tyr Leu Thr 
100 105 110 

Glu Asn He Leu Lys Leu His Trp Leu Ser Lys He Phe Leu Phe Leu 
115 120 125 

5 His Ser Leu Val Leu Leu Met Lys Met His Ser Phe Ala Phe Tyr Asn 
130 135 140 

Gly Tyr Leu Trp Gly He Lys Glu Glu Leu Gin Phe Ser Lys Ser Ala 
145 150 155 160 

Leu Ala Lys Tyr Lys Asp Ser He Asn Asp Pro Lys Val He Gly Ala 
10 165 170 175 

Leu Glu Lys Ser Cys Glu Phe Cys Ser Phe Glu Leu Ser Ser Gin Ser 
180 185 190 

Leu Ser Asp Gin Thr Gin Lys Phe Pro Asn Asn He Ser Ala Lys Ser 
195 200 205 

15 Phe Phe Trp Phe Thr Met Phe Pro Thr Leu He Tyr Gin He Glu Tyr 
210 215 220 

• Pro Arg Thr Lys Glu He Arg Trp Ser Tyr Val Leu Glu Lys He Cys 
225 230 235 240 

Ala He Phe Gly Thr He Phe Leu Met Met He Asp Ala Gin He Leu 
20 245 250 255 

Met Tyr Pro Val Ala Met Arg Ala Leu Ala Val Arg Asn Ser Glu Trp 
260 265 270 

Thr Gly He Leu Asp Arg Leu Leu Lys Trp Val Gly Leu Leu Val Asp 
275 280 285 

25 He Val Pro Gly Phe He Val Met Tyr He Leu Asp Phe Tyr Leu He 
290 295 300 

Trp Asp Ala He Leu Asn Cys Val Ala Glu Leu Thr Arg Phe Gly Asp 
305 310 315 320 

Arg Tyr Phe Tyr Gly Asp Trp Trp Asn Cys Val Ser Trp Ala Asp Phe 
30 325 330 335 

Ser Arg He Trp Asn He Pro Val His Lys Phe Leu Leu Arg His Val 
340 . 345 350 

Tyr His Ser Ser Met Ser Ser Phe Lys Leu Asn Lys Ser Gin Ala Thr 
355 3€0 365 

35 Leu Met Thr Phe Phe Leu Ser Ser Val Val His Glu Leu Ala Met Tyr 
370 375 380 

Val He Phe Lys Lys Leu Arg Phe Tyr Leu Phe Phe Phe Gin Met Leu 
385 390 395 400 



wo 01/16308 PCT/USOO/23863 

30 



Qln Met Pro Leu Val Ala Leu Thr Aen Thr Lys Phe Met Arg Asn Arg 
405 410 415 

Thr He He Oly Asn Val He Phe Trp Leu Gly He Cys Met Gly Pro 
420 425 430 

5 



<210> 38 
<211> 1942 
<212> 0NA 

<213> Arabidopsls thaliana 
10 <400> 38 

ctctcgtgaa tcctttttcc tttcttcttc ttcttctctt cagagaaaac tttgcttctc 60 
tttctataag g'aaccagaca cgaatcccat tcccaccgat ttcttagctt cttccttcaa 120 
tccgctcttt ccctctccat tagattctgt ttcctctttc aatttcttct gcatgcttct 180 
cgattctctc tgacgcctct tttctcccga cgctgtttcg tcaaacgctt ttcgaaatgg 240 

15 cgattttgga ttctgctggc gttactacgg tgacggagaa cggtggcgga gagttcgtcg 300 
atcttgatag gcttcgtcga cggaaatcga gatcggattc ttctaacgga cttcttctct 360 
ctggttccga. taataattct ccttcggatg atgttggagc tcccgccgac gttagggatc 420 
ggattgattc cgttgttaac gatgacgctc agggaacagc caatttggcc ggagataata 480 
acggtggtgg cgataataac ggtggtggaa gaggcggcgg agaaggaaga ggaaacgccg 540 

20 atgctacgtt tacgtatcga ccgtcggttc cagctcatcg gagggcgaga gagagtccac 600 
ttagctccga cgcaatcttc aaacagagcc atgccggatt attcaacctc tgtgtagtag 660 
ttcttattgc tgtaaacagt agactcatca tcgaaaatct tatgaagtat ggttggttga 720 
tcagaacgga tttctggttt agttcaagat cgctgcgaga ttggccgctt ttcatgtgtt 780 
gtatatccct ttcgatcttt cctttggctg cctttacggt tgagaaattg gtacttcaga 840 

25 aatacatatc agaacctgtt gtcatctttc ttcatattat tatcaccatg acagaggttt 900 
tgtatccagt ttacgtcacc ctaaggtgtg attctgcttt tttatcaggt gtcactttga 960 
tgctcctcac ttgcattgtg tggctaaagt tggtttctta tgctcatact agctatgaca 1020 
taagatccct agccaatgca gctgataagg ccaatcctga agtctcctac tacgttagct 1080 
tgaagagctt ggcatatttc atggtcgctc ccacattgtg ttatcagcca agttatccac 1140 

30 " gttctgcatg tatacggaag ggttgggtgg ctcgtcaatt tgcaaaactg gtcatattca 1200 
ccggattcat gggatttata atagaacaat atataaatcc tattgtcagg aactcaaagc 1260 
atcctttgaa aggcgatctt ctatatgcta ttgaaagagt gttgaagctt tcagttccaa 1320 
atttatatgt gtggctctgc atgttctact gcttcttcca cctttggtta aacatattgg 1380 
cagagcttct ctgcttcggg gatcgtgaat tctacaaaga ttggtggaat gcaaaaagtg 1440 

35 tgggagatta ctggagaatg tggaatatgc ctgttcataa atggatggtt cgacatatat 1500 
acttcccgtg cttgcgcagc aagataccaa agacactcgc cattatcatt gctttcctag 1560 
tctctgcagt ctttcatgag ctatgcatcg cagttccttg tcgtctcttc aagctatggg 1620 
cttttcttgg gattatgttt caggtgcctt tggtcttcat cacaaactat ctacaggaaa 1680 
ggtttggctc aacggtgggg aacatgatct tctggttcat cttctgcatt ttcggacaac 1740 

40 cgatgtgtgt gcttctttat taccacgacc tgatgaaccg aaaaggatcg atgtcatgaa 1800 
acaactgttc aaaaaatgac tttcttcaaa catctatggc ctcgttggat ctccgttgat 1860 
gttgtggtgg ttctgatgct aaaacgacaa atagtgttat aaccattgaa gaagaaaaga 1920 
caattagagt tgttgtatcg ca 1942 

<210> 39 
45 <211> 520 
<212> PRT 

<213> Arabidopsls thaliana 
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<400> 39 

Met Ala lie Leu Asp Ser Ala Gly Val Thr Thr Val Thr Glu Asn Gly 
1 5 10 15 

Gly Gly Glu Phe Val Asp Leu Asp Arg Leu Arg Arg Arg Lys Ser Arg 
5 20. 25 30 

Ser Asp Ser Ser Asn Gly Leu Leu Leu Ser Gly Ser Asp Asn Asn Ser 
35 40 45 

Pro Ser Asp Asp Val Gly Ala Pro Ala Asp Val Arg Asp Arg He Asp 
50 • 55 60 

10 Ser Val Val Asn Asp Asp Ala Gin Gly Thr Ala Asn Leu Ala Gly Asp 
€5 70 75 80 

Asn Asn Gly Gly Gly Asp Asn Asn Gly Gly Gly Arg Gly Gly Gly Glu 
85 90 95 

Gly Arg Gly Asn Ala Asp Ala Thr Phe Thr Tyr Arg Pro Ser Val Pro 
15 100 105 110 

Ala His Arg Arg Ala Arg Glu Ser Pro Leu Ser Ser Asp Ala lie Phe 

X15 120 125 

Lys Gin Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu He 
130 135 140 

20 Ala Val Asn Ser Arg Leu He He Glu Asn Leu Met Lys Tyr Gly Trp 
145 150 155 160 

Leu He Arg Thr Asp Phe Trp Phe Ser Ser Arg Ser Leu Arg Asp Trp 
165 170 175 

Pro Leu Phe Met Cys Cys He Ser Leu Ser He Phe Pro Leu Ala Ala 
25 180 185 190 

Phe Thr Val Glu Lys Leu Val Leu Gin Lys Tyr He Ser Glu Pro Val 
195 200 205 

Val He Phe Leu His He He He Thr Met Thr Glu Val Leu Tyr Pro 
210 215 220 

30 Val Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr 
225 230 235 240 

Leu Met Leu Leu Thr Cys He Val Trp Leu Lys Leu Val Ser Tyr Ala 

245 - 250 255 

His Thr Ser Tyr Asp He Arg Ser Leu Ala Asn Ala Ala Asp Lys Ala 
35 260 265 270 

Asn Pro Glu Val Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe 
275 280 285 
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Met Val Ala Pro Thr Leu Cys Tyr Qln Pro Ser Tyr Pro Arg Ser Ala 
290 295 300 

cys lie Arg hye Gly Trp Val Ala Arg Gin Phe Ala Lys Leu Val He 
305 310 315 320 

• 5 Phe Thr Gly Phe Met Gly Phe He He Glu Gin Tyr He Asn Pro He 

325 330 335 

Val Arg Asn Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala He 
340 345 350 

Glu Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys 
10 355 360 365 

Met Phe Tyr Cys Phe Phe His Leu Trp Leu Asn He Leu Ala Glu Leu . 
370 375 380 

Leu Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys 
385 390 395 400 

15 Ser val Gly Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp 

405 410 415 

Met Val Arg His He Tyr Phe Pro Cys Leu Arg Ser Lys He Pro Lys 
420 425 430 

Thr Leu Ala He He He Ala Phe Leu Val Ser Ala Val Phe His Glu 
20 435 440 445 

Leu Cys He Ala Val Pro Cys Arg Leu Phe Lys Leu Trp Ala Phe Leu 
450 455 460 

Qly He Met Phe Gin Val Pro Leu Val Phe He Thr Asn Tyr Leu Gin 
465 470 475 480 

25 Glu Arg Phe Gly Ser Thr Val Gly Asn Met He Phe Trp Phe He Phe 

485 490 495 

Cys He Phe Gly Qln Pro Met Cys Val Leu Leu Tyr Tyr His Asp Leu 
500 505 510 

Met Asn Arg Lys Gly Ser Met Ser 
30 515 520 



<210> 40 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 



<400> 40 

S tgcaaattga cgagcacacc aaccccttc 



<210> 41 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence: Synthetic 
oligonuclotide primer 

<400> 41 

aaggatgctt tgagttcctg acaatagg 



15 <210> 42 

<211> 1942 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 42 

20 ctctcgtgaa tcctttttcc tttcttcttc ttcttctctt cagagaaaac tttgcttctc 60 
tttctataag gaaccagaca cgaatcccat tcccaccgat ttcttagctt cttccttcaa 120 
tccgctcttt ccctctccat tagattctgt ttcctctttc aatttcttct gcatgcttct 180 
cgattctctc tgacgcctct tttctcccga cgctgtttcg tcaaacgctt ttcgaaatgg 240 
cgattttgga ttctgctggc gttactacgg tgacggagaa cggtggcgga gagttcgtcg 300 

25 atcttgatag gcttcgtcga cggaaatcga gatcggattc ttctaacgga cttcttctct 360 
ctggttccga taataattct ccttcggatg -atgttggagc tcccgccgac gttagggatc 420 
ggattgattc cgttgttaac gatgacgctc agggaacagc caatttggcc ggagataata 480 
acggtggtgg cgataataac ggtggtggaa gaggcggcgg agaaggaaga ggaaacgccg 540 
atgctacgtt tacgtatcga ccgtcggttc cagctcatcg gagggcgaga gagagtccac 600 

30 ttagctccga cgcaatcttc aaacagagcc atgccggatt attcaacctc tgtgtagtag 660 
ttcttattgc tgtaaacagt ag^ctcatca tcgaaaatct tatgaagtat ggttggttga 720 
tcagaacgga tttctggttt agttcaagat cgctgcgaga ttggccgctt ttcatgtgtt 780 
gtatatccct ttcgatcttt cctttggctg cctttacggt tgagaaattg gtacttcaga 840 
aatacatatc agaacctgtt gtcatctttc ttcatattat tatcaccatg acagaggttt 900 

3S tgtatccagt ttacgtcacc ctaaggtgtg attctgcttt tttatcaggt gtcactttga 960 
tgctcctcac ttgcattgtg tggctaaagt tggtttctta tgctcatact agctatgaca 1020 
taagatccct agccaatgca gctgataagg ccaatcctga agtctcctac tacgttagct 1080 
tgaagagctt ggcatatttc atggtcgctc ccacattgtg ttatcagcca agttatccac 1140 
gttctgcatg tatacggaag ggttgggtgg ctcgtcaatt tgcaaaactg gtcatattca 1200 

40 ccggattcat gggatttata atagaacaat atataaatcc tattgtcagg aactcaaagc 1260 
atcctttgaa aggcgatctt ctatatgcta ttgaaagagt gttgaagctt tcagttccaa 1320 
atttatatgt gtggctctgc atgttctact gcttcttcca cctttggtta aacatattgg 1380 
cagagcttct ctgcttcggg gatcgtgaat tctacaaaga ttggtggaat gcaaaaagtg 1440 
tgggagatta ctggagaatg tggaatatgc ctgttcataa atggatggtt cgacatatat 1500 

45 acttcccgtg cttgcgcagc aagataccaa agacactcgc cattatcatt gctttcctag 1560 
tctctgcagt ctttcatgag ctatgcatcg cagttccttg tcgtctcttc aagctatggg 1620 
cttttcttgg gattatgttt caggtgcctt tggtcttcat cacaaactat ctacaggaaa 1680 
ggtttggctc aacggtgggg aacatgatct tctggttcat cttctgcatt. ttcggacaac 1740 
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cgatgtgtgt gcttctttat taccacgacc 
acaactgttc aaaaaatgac tttcttcaaa 
gttgtggtgg ttctgatgct aaaacgacaa 
caattagagt tgttgtatcg ca 



tgatgaaccg aaaaggatcg atgtcatgaa 1800 
catctatggc ctcgttggat ctccgttgat X860 
atagtgttat aaccattgaa gaagaaaaga 1920 

1942 



5 <210> 43 
<211> 234 
<212> DI!IA 
<213> Glycine max 

<220> 
10 <221> unsure 

<222>- (1) . . (234) 
<223> h=\inknown 

<400> 43 

gtaagcttca agagcttagc atanttcctg 
15 ctatcctcgc acaccttata ttcgaaaggg 
atatttacag gagttatggg atttataata 
tcacagcatc ctctcaaggg aaaccttctt 



gttgccccta ncattatgtt accagccaan 60 
ttggctgttt cgccaacttg tcaactgata 120 
gaacaataca ttaatcccat tgtacaaaat 180 
tacgccatcg.agagagttct gaag 234 



<210> 44 
<211> 267 
20 <212> DNA 

<213> Glycine max 

<400> 44 

ctgcttttgt atctggtgtc acgttgatg6 
tgtcatatgc acatacaaac tatgatatga 
25 aaacattacc caatactttg atatggagta 
atacttcatg gttgctccta cattatgcta 
tcgaaagggt tgggtgtttc gtcaact 



tattaacttg cattgtgtgg ttaaaattgg 60 
gagcacttac tgtttcgaat gaaaagggag 120 
tccgtacact gtgaccttca ggagtttggc 180 
tcagacaagc tatcctcgca caccttcagt 240 

267 



<210> 45 
<211> 275 
30 <212> DNA 

<:213> Glycine max 

<220> 

<22l> unsure 
<222> (1) . . (275> 
35 <223> n=unknown 

<400> 45 

gtggaatgcc aaaactgttg aagattattg 
gatgatccgc cacctatatt ttccatgttt 
tttaattgcc ttcctggttc tgctttattc 
40 catattcaag tngtgggttt cngnggaatt 
attnntnggc naaaaaattc cnngaacccc 



gaggatgtgg aatatgcctg ttcacaaatg 60 
aaggcacggt ataccaaagg ccgttgctct 120 
catgagctgt gcatcgctgt tccttgccca 180 
nagtttcagg tnccttgggt ttcnaccnna 240 
ggggg 275 
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<210> 46 

<211> 257 
<212> DNA 

<213> Glycine max 



5 <400> 46 

aacggaattg agactccaga gaatatgcca 
ggcttttgga aaaactggca tgcttccttc 
cctcttgggg gatctaagaa aaagctacta 
atctggcatg atttagagtg gaagcttctt 
10 atccctgagt tggtttt 



aaatgtatta ataattgtca caacttggaa 60 
aacaagtggc ttgtgaggta tatatacatt 120 
aatgtgtggg ttgttttcac atttgttgca 18 o 
tcatgggcat ggttgacgtg tttattcttc 240 

257 



<210> 47 
<211> 253 
<212> DNA 
<213> Zea mays 

15 <400> 47 

agaaaatgga acatgcctgt gcataaatgg 
cgaaatggta tatcaaagga agttgctgtt 
atgagttatg tgttgctgtt ccctgccaca 
atgcttcaga ttcccctcat catattgaca 

20 atggttggca ata 



attgttcgtc atatatattt tccttgcatg 60 
tttatatcgt tcttgtttct gctgtacttc 120 
tactcaagtt ctgggctttt tttaggaatc 180 
tcatacctca aaaataaatt cagtgacaca 2.40 

253 



<210> 48 
<211> 254 
<212> DNA 
<213> Zea mays 



25 <400> 48 

tgaagtatgg cttattaata agatctggct 
ggccactgct aatgtgttgc cttagtctac 
aaaagttggc attcaacaat ctcattagtg 
ttacaacatt tgaaattgta tatccagtgc 

30 tatcaggctt tgtg 



tttggtttaa tgctacatca ttgcgagact 60 
ccatatttcc ccttggtgca tttgcagtcg 120 
atcctgctac tacctgtttt cacatccttt 180 
tcgtgattct taagtgtgat tctgcagttt 240 

254 



<210> 49 
<211> 262 
<212> DNA 
<213> Zea mays 

35 <400> 49 

gaagtatggc ttattaataa gatctggctt 
gccactgcta atgtgttgcc ttagtctacc 
aaagttggca ttcaacaatc tcattagtga 
tacaacattt gaaattgtat atccagtgct 

40 acaggctttg tgttgatgtt ta 



ttggtttaat gctacatcat tgcgagactg 60 
catatttccc cttggtgcat ttgcagtcga 120 
tcctgctact acctgttttc acatcctttt 180 
cgtgattctt aagtgtgatt ctgcagtttt 240 

262 



<210> 50 
<211> 325 
<212> DNA 
. <213> Zea mays 
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<220> 

<221> unsure 
<222> (1) . . (32S) 
<223> nsBunknovm 

5 <400> 50 

taatcnaacc tcgntncngg ttcagctgta 
cacatantca natctnggca tnncngggat 
aagatatctc catgctacgt tcaagcatgt 
cagtatagtc ggacagccga tgtnimnnna 
10 ggcccaggca agtagatagt ncggcagaga 
nacngagcga gcggcangaa ncagc 



tnccatgaga tatgtaatgc ggtgccgtgc 60 
catngttcag ataccgntgg. nattcttgac 120 
aatggtgggc aacatgatan tttggntctn 180 
tctatactac catgacgtca tgaacaggca 24 0 
catgtacttc aacatcganc atcagnagca 300 

325 



<210> 51 

<211> 519 \ 
<212> DNA 
15 <213> Mortierrella alpina 

<220> 

<221> unsure 
<222> (1) . . (519) 
<223> hsunknown 

20 <400> 51 

gagnnnngna acgtttagcc tnccgtagcc gccaaaatcc aagggncnac cnaccctncg 60 
ttanactnaa ttngaaaatn cnnncccaac ttnaggnact tnnagncccc ccnacttgac 120 
aacggagcac tatatttacc ccgtggtngt tcaacccagc catctcaccc ttgcgagcat 180 
tggtgctgct cttgataccc ttcatgctta actatctcat gatcttttac atcattttcg 240 
25 agtgcatctg caacgccttt gcggaactaa gttgctttgc ggatcgcaac ttttacgagg 300 
attggtggaa ctgcgtcagc tttgatgagt gggcacgcaa atggaacaag cctgtgcaac 360 
acttcttgct ccgccacgtg tacgactcga gcatccgagt ccttccactt gtccgaaatc 420 
caatgccgcn aattgcaaac gttccttccc ggtcgtcaat gcgttcaacg aacctgggtg 480 
aagaatgggt ggtgacaacg ttaaagtgcg cccggtatc 519 

J 

30 <210> 52 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 52 

ggatccgcgg ccgcacaatg aaaaaaatat cttcacatta ttcgg 45 



<210> S3 
40 <211> 40 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 53 

5 ggatcccctg caggtcattc attgacggca ttaacattgg 



<210> 54 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 54 

ggatccgcgg ccgcacaatg ggagcgaatt cgaaatcagt aacg 



15 <210> 55 
<211> 40 
<2i2> DNA 

<213> Artificial Sequence 
<220> 

20 <223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 55 

ggatcccctg caggttaata cccactttta tcaagctccc 



<210> 56 
25 <211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
30 oligonucleotide primer 

<40O> 56 ' 

ggatccgcgg ccgcacaatg tctctattac tggaagagat c 



<210> 57 . 
<211> 41 
35 <212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 
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<400> 57 

ggatcccctg caggttatgc atcaacagag acacttacag c 41 



<210> 58 
<211> 41 
5 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

10 <400> 58 . 

ggatccgcgg ccgcacaatg. ggctggattc cgtgtccgtg c 41 



<210> 59 
<211> 38 
<212> DNA 
15 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 55 

20 ggatcccctg caggttaacc agaatcaact actttgtg 38 



<210> 60 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
25 <220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 60 

tcgacctgca ggaagcttag aaatggcgat tttggattc 39 



30 <210> 61 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> Description of Artificial Sequence: 
Oligonucleotide primer 



<400> 61 

ggatccgcgg ccgct'catga catcgatcct tttcgg 



36 
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<210> 62 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
5 <220> 

<223> Description of Artificial Sequence: Annealed 
oligonucleotide adapter 

<400> 62 

.cgcgatttaa atggcgcgcc ctgcaggcgg ccgcctgcag ggcgcgccat ttaaat 



10 <210> 63 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

15 <223> Description of Artificial Sequence: Ligating 
ol igonucleot ide 

<400> 63 

. tcgaggatcc gcggccgcaa gcttcctgca gg 



<210> 64 
20 <211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ligating 
25 oligonucleotide 

<400> 64 

tcgacctgca ggaagcttgc ggccgcggat cc 



<210> 65 
<211> 32 
30 , <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 

35 <400> 65 

tcgacctgca ggaagcttgc ggccgcggat cc 
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<210> 66 
<211> 32 
<:212> DNA 
<213>- Artificial 



Sequence 



5 <220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 



<400> 66 

tcgaggatcc gcggccgcaa gcttcctgca gg 32 



10 <210> 67 
<211> 36 
<212> DNA 

<213> Artificial Sequence 



<220> 

15 <223> Description of Artificial Sequence: Ligating 
oligonucleotide 

<400> 67 

tcgaggatcc gcggccgcaa gcttcctgca ggagct 



<210> 68 
20 <211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ligating 
25 oligonucleotide 

<400> 68 

cctgcaggaa gcttgcggcc gcggatcc 



<210> 69 
<211> 36 
30 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequences Ligating 
oligonucleotide 



3S <400> 69 

tcgacctgca ggaagcttgc ggccgcggat ccagct 36 
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<210> 70 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
S <220> 

<223> Description of Artificial Sequence: Iiigating 
oligonucleotide 

<400> 70 

ggatccgcgg ccgcaagctt cctgcagg 28 



10 <210> 71 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

15 <223> Description of Artificial Sequence: Iiigating 
oligonucleotide 

<400> 71 

gatcacctgc aggaagcttg cggccgcgga tccaatgca 39 

<210> 72 
20 <211> 31 
<212> DNA 

<213> Artificial Sequence 



<210> 73 
<211> 2013 
30 <212> DNA 

<213> Arabidopsis thaliana 

<400> 73 

atgcccctta ttcatcggaa aaagccgacg gagaaaccat cgacgccgcc atctgaagag 60 

gtggtgcacg atgaggattc gcaaaagaaa ccacacgaat cttccaaatc ccaccataag 120 

35 aaatcgaacg gaggagggaa gtggtcgtgc atcgattctt gttgttggtt cattgggtgt 180 

gtgtgtgtaa cctggtggtt tcttctcttc ctttacaacg caatgcctgc gagcttccct 240 

cagtatgtaa cggagcgaat cacgggtcct ttgcctgacc cgcccggtgt taagctcaaa 300 

•aaagaaggtc ttaaggcgaa acatcctgtt gtcttcattc ctgggattgt caccggtggg 360 

ctcgagcttt gggaaggcaa acaatgcgct gatggtttat ttagaaaacg tttgtggggt 420 

40 ggaacttttg gtgaagtcta caaaaggcct ctatgttggg tggaacacat gtcacttgac 480 

aatgaaactg ggttggatcc agctggtatt agagttcgag ctgtatcagg actcgtggct 540 

gctgactact ttgctcctgg ctactttgtc tgggcagtgc tgattgctaa ccttgcacat '600 

attggatatg aagagaaaaa tatgtacatg gctgcatatg actggcggct ttcgtttcag 660 



25 



<220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 



<40O> 72 

ttggatccgc ggccgcaagc ttcctgcagg 



t 



31 
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aacacagagg tacgtgatca gactcttagc 
tctaccaacg gtggaaaaaa agcagttata 
ctacatttta tgaagtgggt tgaggcacca 
tggtgtgcaa agtatattaa ggcggtgatg 
5 aaagctgttg cagggctttt ctctgctgaa 
gccccaggat tcttagacac cgatatattt 
atgacacgca catgggactc aacaatgtct 
ggcgggcttg attggtcacc ggagaaaggc 
aacgaaactt gtggtgaagc aggtgaaaac 

10 ggaaggatga tatcttttgg gaaagaagta 
attgattttc gaggtgctgt caaaggtcag 
tggacagagt accatgacat gggaattgct 
tacactgctg gtgaagctat agatctacta 
ggtgccgctc atttctctta tggaattgct 

15 cccaaatact ggtcaaatcc gttagagaca 
tactcattat acggagtggg gataccaacg 
tctcccgaca gttgcatccc ctttcagata 
agctgtctga aagcaggagt ttacaatgtg 
gccgggtaca tgtgtgcaaa agcgtggcgt 

20 aagacttata taagagaata caatcactct 
acgcagagtg gtgcccatgt tgatatcatg 
agggttgccg ccggaggtaa cgggtctgat 
tttgaatggt cggagcgtat tgacctgaag 



cgtatgaaaa gtaatataga gttgatggtt 720 
gttccgcatt ccatgggggt cttgtatttt 780 
gctcctctgg gtggcggggg tgggccagat 840 
aacattggtg gaccatttct tggtgttcca 900 
gcaaaggatg ttgcagttgc cagagcgatt 960 
agacttcaga ccttgcagca tgtaatgaga 1020 
atgttaccga agggaggtga cacgatatgg 1080 
cacacctgtt gtgggaaaaa gcaaaagaac 1140 
ggagtttcca agaaaagtcc tgttaactat 1200 
gcagaggctg cgccatctga gattaataat 1260 
agtatcccaa atcacacctg tcgtgacgtg 1320 
gggatcaaag ctatcgctga gtataaggtc 1380 
cattatgttg ctcctaagat gatggcgcgt 1440 
gatgatttgg atgacaccaa gtatcaagat 15O0 
aaattaccga atgctcctga gatggaaatc 1560 
gaacgagcat acgtatacaa gcttaaccag 1620 
ttcacttctg ctcacgagga ggacgaagat 1680 
gatggggatg aaacagtacc cgtcctaagt 1740 
ggcaagacaa gattcaaccc ttccggaatc 1800 
ccgccggcta acctgttgga agggcgcggg 1860 
ggaaactttg ctttgatcga agatatcatg 1920 
ataggacatg accaggtcca ctctggcata 1980 
ctg 2013 



<210> 74 
25 <211> 671 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 74 

Met Pro Leu He His Arg Lys Lys Pro Thr Glu Lys Pro Ser Thr Pro 
30 1 5 10 15 

Pro Ser Glu Glu Val Val His Asp Glu Asp Ser Gin Lys Lys Pro His 
20 25 30 

. Glu Ser Ser Lys Ser His His Lys Lys Ser Asn Qly Gly Gly Lys Trp 
35 40 45 

35 Ser Cys He Asp Ser Cys Cys Trp Phe He Qly Cys Val Cys Val Thr 
50 55 60 

Trp Trp Phe Leu Leu Phe Leu Tyr Asn Ala Met Pro Ala Ser Phe Pro 
65 70 75 ' 80 

Gin Tyr Val Thr Glu Arg He Thr Gly Pro Leu Pro Asp Pro Pro Gly 
40 85 90 95 

Val Lys Leu Lys Lys Glu Gly Leu Lys Ala Lys His Pro Val Val Phe 
100 105 110 

He Pro Gly He Val Thr Gly Gly Leu Glu Leu Trp Glu Gly Lys Gin . 
115 120 125 
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Cys Ala Asp Gly Leu Phe Arg Lys Arg Leu Trp Gly Gly Thr Phe Gly 
130 1^5 140 

Glu Val Tyr Lys Arg Pro Leu Cys Trp Val Glu His Met Ser Leu Asp 
145 ISO 155 160 

5 Asn Glu Thr Gly Leu Asp Pro Ala Gly He Arg Val Arg Ala Val Ser 

165 170 175 

Gly Leu Val Ala Ala Asp Tyr Phe Ala Pro Gly Tyr Phe Val Trp Ala 
180 185 190 



10 



Val Leu He Ala Asn Leu Ala His He Gly Tyr Glu Glu Lys Asn Met 
195 200 205 

Tyr Met Ala Ala Tyr Asp Trp Arg Leu Ser Phe Gin Asn Thr Glu Val 
210 215 220 

Arg Asp Gin Thr Leu Ser Arg Met Lys Ser Asn He Glu Leu Met Val 
225 230 235 240 

15 ser Thr Asn Gly Gly Lys Lys Ala Val He Val Pro His Ser Met Gly 

245 250 255 

val Leu Tyr Phe Leu His Phe Met Lys Trp Val Glu Ala Pro Ala Pro 
260 265 270 

Leu Gly Gly Gly Gly Gly Pro Asp Trp C?ys Ala Lys Tyr He Lys Ala 
20 275 280 285 

Val Met Asn He Gly Gly Pro Phe Leu Gly Val Pro Lys Ala Val Ala 
290 295 300 

Gly Leu Phe Ser Ala Glu Ala Lys Asp Val Ala Val Ala Arg Ala He 
305 310 315 320 

25 Ala Pro Gly Phe Leu Asp Thr Asp He Phe Arg Leu Gin Thr Leu Gin 

325 330 335 

His Val Met Arg Met Thr Arg Thr Trp Asp Ser Thr Met Ser Met Leu 
340 345 350 

Pro Lys Gly Gly Asp Thr He Trp Gly Gly Leu Asp Trp Ser Pro Glu 
30 355 360 365 

Lys Gly His Thr Cys Cys Gly Lys Lys Gin Lys Asn Asn Glu Thr Cys 



370 



375 



380 



Gly Glu Ala Gly Glu Asn Gly Val Ser Lys Lys Ser Pro Val Asn Tyr 
385 390 395 400 

35 Gly Arg Met He Ser Phe Gly Lys Glu Val Ala Glu Ala Ala Pro Ser 

405 410 415 

Glu He Asn Asn He Asp Phe Arg Gly Ala Val Lys Gly Gin Ser He 
420 425 43.0 
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Pro Asn His Thr Cys Arg Asp Val Trp Thr Glu Tyr His Asp Met Gly 
435 440 445 

lie Ala Gly lie Lys Ala lie Ala Glu Tyr Lys Val Tyr Thr Ala Gly 
450 455 460 

5 Glu Ala He Asp Leu Leu His Tyr Val Ala Pro Lys Met Met Ala Arg 
465 470 475 480 

Gly Ala Ala His Phe Ser Tyr Gly Xle Ala Asp Asp Leu Asp Asp Thr 
485 490 495 

Lys Tyr Gin Asp Pro Lys Tyr Trp Ser Asn Pro Leu Glu Thr Lys Leu 
10 500 505 510 

Pro Asn Ala Pro Glu Met Glu He Tyr Ser Leu Tyr Gly Val Gly He 
515 520 525 

Pro Thr Glu Arg Ala Tyr Val Tyr Lys Leu Asn Gin Ser Pro Asp Ser 
530 535 540 

15 Cys lie Pro Phe Gin He Phe Thr Ser Ala His Glu Glu Asp Glu Asp 
545 550 555 560 

Ser Cys Leu Lys Ala Gly Val Tyr Asn Val Asp Gly Asp Glu Thr Val 
565 570 575 

Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Ala Trp Arg Gly Lys 
20 580 585 590 

Thr Arg Phe Asn Pro Ser Gly He Lys Thr Tyr He Arg Glu Tyr Asn 
595 600 605 

His Ser Pro Pro Ala Asn Leu Leu Glu Gly Arg Gly Thr Gin Ser Gly 
610 615 620 

25 Ala His Val Asp He Met Gly Asn Phe Ala Leu He Glu Asp He Met 
625 630 635 640 

Arg Val Ala Ala Gly Gly Asn Gly Ser Asp He Gly His Asp Gin Val 
645 650 655 



His Ser Gly He Phe Glu Trp Ser Glu Arg He Asp Leu Lys Leu 
30 660 665 670 



<210> 75 
<211> 1986 
<212> DNA 

<213> Saccharomyces cerevisiae 

35 <400> 75 

atgggcacac tgtttcgaag aaatgtccag 
aaagggggtt ctgttcataa caagcgagag 
ttaggccata agagaagaag gggtattagt 
gatttcgaca ggaaaagaga cgggaacggt 



aAccaaaaga gtgattctga tgaaaacaat 60 
agcagaaacc acattcatca tcaacaggga 120 
ggcagtgcaa aaagaaatga gcgtggcaaa 180 
agaaaacgtt ggagagattc cagaagactg 240 
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attttcattc ttggtgcatt cttaggtgta 
gttcataata gcgatagcga cttgtttgac 
tatttggatg attggaaaga tgttctccca 
caggctggta actactccac atcttcttta 
5 aaacaactct tacgtgatta taatatcgag 
gtcatttcta cgggaattga aagctgggga 
gcgcattttc gtaaacggct gtggggaagt 
aaagtttgtt ggttgaaaca tgtaatgtta 
tttacgctac gtgcagcaca gggcttcgaa 

10 atttggaaca aagttttcca aaatctggga 
agtgctgcgt atgattggag gcttgcatat 
acgaagctaa aggaacaaat cgaactgttt 
attggacatt ctatgggttc tcagattatc 
ggccctcttt acggtaatgg tggtcgtggc 

15 aatgcagcag ggacgcttct gggcgctcca 
atgaaagata ccattcaatt aaatacgtta 
agaattgaga gagtaaaaat gttacaaacg 
ggagaagagg tcatttgggg ggatatgaag 
actgacacat acggcaattt cattcgattt 

20 aatttgacaa tgaaagacgc cattaacatg 
agaagagtac atgagcagta ctcgttcggc 
aatgagctac .accacaagca ctggtcgaat 
cacatgaaaa tctattgtat atacggggtg 
aaggaagagg atgactcctc tgctctgaat 

25 gtattcctca ccgaggggga cggaaccgtt 
tgggcccagg gtgcttcacc gtacaaccct 
aaacaccagc cagatcgatt tgatatacgt 
atcctcggca gcgcggagtt gaacgattac 
ctcgtcgagc cacgccaatt gtctaatttg 

30 atgtaa 



cttttgccgt ttagctttgg cgcttatcat 300 
aactttgtaa attttgattc- acttaaagtg 360 
caaggtataa gttcgtttat tgatgatatt 420 
gatgatctca gtgaaaattt tgccgttggt 480 
gccaaacatc ctgttgtaat ggttcctggc 540 
gttattggag acgatgagtg cgatagttct 600 
ttttacatgc tgagaacaat ggttatggat 660 
gatcctgaaa caggtctgga cccaccgaac 720 
tcaactgatt atttcatcgc agggtattgg 780 
gtaattggct atgaacccaa taaaatgacg 840 
ttagatctag aaagacgcga taggtacttt 900 
catcaattga gtggtgaaaa agtttgttta 960 
ttttacttta tgaaatgggt cgaggctgaa 1020 
tgggttaacg aacacataga ttcattcatt 1080 
aaggcagttc cagctctaat tagtggtgaa .114 0 
gccatgtatg gtttggaaaa gttcttctca 1200 
tggggtggta taccatcaat gctaccaaag 1260 
tcatcttcag aggatgcatt gaataacaac 1320 
gaaaggaata cgagcgatgc tttcaacaaa 1380 
acattatcga tatcacctga atggctccaa 1440 
tattccaaga atgaagaaga gttaagaaaa 1500 
ccaatggaag taccacttcc agaagctccc 1560 
aacaacccaa ctgaaagggc atatgtatat 1620 
ttgaccatcg actacgaaag caagcaacct 1680 
ccgctcgtgg cgcattcaat gtgtcacaaa 1740 
gccggaatta acgttactat tgtggaaatg 1800 
ggtggagcaa aaagcgccga acacgtagac 1860 
atcttgaaaa ttgcaagcgg taatggcgat 1920 
agccagtggg tttctcagat gcccttccca 1980 

1936 



) 



<210> 76 

<211> 661 
<212> PRT 

<213> Saccharorayces cerevisiae 
35 <400> 76 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
1 5 10 15 

Asp Glu Asn Aen Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

40 Asn His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 • 40 45 . 

He Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 60 . 

* Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
45 65 70 75 80 

He Phe He Leu Giy Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 



wo 01/16308 PCT/USOO/23863 

46 



Qly Ala Tyr His Val His Aan Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 110 

Val Asa Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

5 Leu Pro Gin Gly He Ser Ser Phe He Asp Asp He Gin Ala Gly Asn 
130 135 140 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

Lys Gin Leu Leu Arg Asp Tyr Asn He Glu Ala Lys His Pro Val Val 
10 165 170 175 

Met Val Pro Gly Val He Ser Thr Gly He Glu Ser Trp Gly Val He 
180 185 190 

Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 

15 Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 
20 245 250 255 

Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 
260 265 270 

Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 

25 Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 295 '300 

Glu Gin He Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

He Gly His Ser Met Gly Ser Gin He He Phe Tyr Phe Met Lys Trp 
30 325 330 335 

Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 

Asn Glu His He Asp Ser Phe He Asn Ala Ala Gly Thr Leu Leu Gly 
355 360 365 



J 



35 Ala Pro Lys Ala Val Pro Ala Leu He Ser Gly Glu Met Lys Asp Thr 
370 375 380 



He Gin Leu Asn Thr Leu 
385. 390 



Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
395 400 
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<210> 77 

<211> 35 

<212> DNA 

<213> Artificial 



Sequence 



5 <220> 

<223> Description of Artificial Sequeiice: Synthetic 
oligonucleotide primer 

<400> 77 

ggatccgcgg ccgcacaatg ccccttattc atcgg 



10 <210> 78 
<211> 35 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

15 <223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 78 

ggatcccctg caggtcacag cttcaggtca atacg 



<210> 79 
20 <211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Synthetic 
25 oligonucleotide primer 

<400> 79 

ggatccgcgg ccgcacaatg ggcacactct ttcgaag 37 



<210> 80 
<211> 39 
30 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 



35 <400> 80 

ggatcccctg caggttacat tgggcacact gtttcgaag 



39 
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